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Preface 



This volume collects the proceedings of a conference on free boundary problems 
that took place in Trento (Italy) in June 2002. 

Many phenomena of interest for applications are represented by differential 
equations which are defined in a domain whose boundary is a priori unknown, and 
is accordingly named a “free boundary a further quantitative condition is then 
provided in order to exclude indeterminacy. Free boundaries are often related to 
discontinuities in constitutive relations; this raises a number of interesting math- 
ematical issues: existence of solutions in function spaces, uniqueness, regularity 
properties, numerical approximation procedures, and other questions have been 
extensively investigated. 

Examples of free boundary problems include the classical Stefan problem and 
more general models of phase transitions: here the free boundary is represented 
by the moving interface between the phases. Free boundaries also occur as fronts 
between saturated and unsaturated regions in filtration through porous media, 
between plastic and elastic phases in continuous mechanics, between magnetic 
domains in ferromagnetism, just to mention a few cases. Relevant examples also 
come from reaction-diffusion, fluid dynamics, biomathematics, and so on. 

Several of these problems are of interest for applications in industry and offer 
opportunities of collaboration among mathematicians, physicists, engineers, ma- 
terial scientists, biologists, and other researchers. A large community of applied 
scientists spread over the world has been formulating and studying these prob- 
lems for many years. They have also been meeting in major conferences every 
three years or so; the first appointment was held at Montecatini in 1981, and was 
followed by Maubuissons, Irsee, Montreal, Toledo, Zakopane, Crete, Chiba. The 
next conference is planned for 2005 in Portugal. The proceedings of these con- 
ferences provide a comprehensive picture of the development of research on free 
boundary problems in the last decades. 

The Trento conference saw the participation of more than 150 scientists, 
coming from a long list of countries. Talks included twenty plenary addresses, and 
seven focus sessions were devoted to selected topics: free boundary problems in 
polymer science, image processing, grain boundary motion, numerical aspects of 
free boundary problems, free boundary problems in biomathematics, modelling in 
crystal growth, and transitions in anisotropic materials. A poster session open to 
the contribution of all participants was also organized. Post-dinner entertainment 
included two concerts performed by professional and semi-professional musicians. 

Following the tradition of the series, the contributions concern problems 
which are either directly related to free boundaries, or may be so in perspec- 
tive. Special emphasis was put on interdisciplinarity and on issues of applicative 
relevance. This variety is obviously reflected in these proceedings, which are likely 
to meet the interests of a large spectrum of readers. 
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Structural Optimization by the 
Level- Set Method 

Gregoire Allaire, Francois Jouve, and Anca-Maria Toader 



Abstract. In the context of structural optimization, we describe a new numer- 
ical method based on a combination of the classical shape derivative and of 
the level-set method for front propagation. We have implemented this method 
in two and three space dimensions for models of linear or non-linear elastic- 
ity, with various objective functions and constraints on the volume or on the 
perimeter. The shape derivative is computed by an adjoint method. The cost 
of our numerical algorithm is moderate since the shape is captured on a fixed 
Eulerian mesh. Although this method is not specifically designed for topology 
optimization, it can easily handle topology changes. 



1. Introduction 

Shape optimization of elastic structures is a very important and popular field. 
The classical method of shape sensitivity (or boundary variation) has been much 
studied (see e.g. [13], [16], [20], [21]). It is a very general method which can han- 
dle any type of objective functions and structural models, but it has two main 
drawbacks: its computational cost (because of remeshing) and its tendency to fall 
into local minima far away from global ones. The homogenization method (see 
e.g. [1], [2], [5], [7], [8], [12]) is an adequate remedy to these drawbacks but it is 
mainly restricted to linear elasticity and particular objective functions (compli- 
ance, eigenfrequency, or compliant mechanism). Yet, another method has recently 
been proposed in [14], [18], [4] relying on the level-set method which has been 
devised by Osher and Sethian [15], [17] for numerically tracking fronts and free 
boundaries. The level-set method is versatile and computationally very efficient: 
it is by now a classical tool in many fields such as motion by mean curvature, fluid 
mechanics, image processing, etc. 

The paper [14] studied a two-phase optimization of a membrane (modelled by 
a linear scalar partial differential equation), i.e. the free boundary was the interface 
between two constituents occupying a given domain. It combined the level-set 
method with the shape sensitivity analysis framework. On the other hand, the work 
[18] focused on structural optimization within the context of two-dimensional linear 
elasticity. In that setting, the free boundary was the shape of the structure, which 
was captured on a fixed mesh using the immersed interface method. However, 
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[18] did not rely on shape sensitivity analysis: rather, the structural rigidity was 
improved by using an ad hoc criterion based on the Von Mises equivalent stress. 

In [4], we have generalized these two previous works in several aspects. We 
give here a brief review of our approach, based on a systematic implementation of 
the level-set method in which the front velocity is derived from a shape sensitivity 
analysis. We focus on shape optimization rather than two-phase optimization, 
and we replace the immersed interface method by the simpler “ersatz material” 
approach which amounts to filling the holes by a weak phase. This is a well- 
known approach in topology optimization, and it can be rigorously justified in 
some cases [1]. We compute a shape derivative by using an adjoint problem. Then, 
the shape derivative is used as the normal velocity of the free boundary which is 
moved during the optimization process. Front propagation is performed by solving 
a Hamilton- Jacobi equation for a level-set function. 

Acknowledgements: This work has been supported by the grant CNRS/ICCTI n° 
2002-12163 of the Centre National de la Recherche Scientifique (France) and the 
Instituto de Cooperagao Cientffica e Tecnologica Internacional (Portugal). 



2. Setting of the problem 

We start by describing a model problem in linearized elasticity. There is no concep- 
tual difficulty in choosing another model, and in particular a nonlinear elasticity 
problem. Let ft C (d — 2 or 3) be a bounded open set occupied by a li- 
near isotropic elastic material with Hooke’s law A. Recall that, for any symmetric 
matrix £, A is defined by 

+ \(Tr£) Id, 

where fx and A are the Lame moduli of the material. The boundary of ft is made 
up of two disjoint parts 

dft = r^v ur D , 

with Dirichlet boundary conditions on and Neumann boundary conditions on 
T jv • The two boundary parts T jj and T n are allowed to vary in the optimization 
process, although it is possible to fix some portion of it (see the numerical examples 
below). 

We denote by / ( g ) the vector-valued function of the volume forces (of the 
surface loads, respectively). The displacement field u in ft is the solution of the 
linearized elasticity system 

{ — div (A e(u)) = f in ft 

u = 0 onTjj (1) 

(Ae(u))n = g on Tat- 

Since ft is varying during the optimization process, / and g must be known for 
every possible configuration of ft. We therefore introduce a working domain D (a 
bounded open set of R d ) which contains all the admissible shapes ft. 
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To give a precise mathematical meaning to (1), we choose / E L 2 (D) d and g E 
H l (D) d and we assume that Tp ^ 0 (otherwise we should impose an equilibrium 
condition on / and g). In such case it is well known that (1) admits a unique 
solution in H l (Q) d . 

The objective function is denoted by J(£l). In this paper, we shall mostly 
focus on two possible choices of J (these are merely examples, and much more 
freedom is allowed). A first classical choice is the compliance (the work done by 
the load) 

Ji(ft) = / f’udx+ / g-uds= / A e(u) • e(u) dx , (2) 

J q J r n «/ f2 

which is very common in rigidity maximization. A second choice is a least square 
error compared to a target displacement 




k(x)\u — uo\ a dx 



1/a 



(3) 



which is a useful criterion for the design of compliant mechanisms [3], [19]. We 
assume a > 2, u 0 E L a (D) and k E L°°(D), k being a non-negative given weighting 
factor. In both formulae (2) and (3), u = u(Q) is the solution of (1). We define a 
set of admissible shapes which must be open sets contained in the working domain 
D and having a fixed volume V 

U a d = C D such that |f2| = V^. (4) 

Our model problem of shape optimization is 



inf J(H). (5) 

It is well known that the minimization problem (5) is usually not well posed on the 
set of admissible shapes defined by (4) (i.e. it has no solution). In order to obtain 
existence of optimal shapes, either some smoothness or geometrical or topological 
constraints are required. For example, a variant of (5) with a perimeter constraint 
turns out to be a well-posed problem (see [6]). More precisely, if t > 0 is a positive 
Lagrange multiplier, the minimization problem 

inf (j(Q)~\- f ds) (6) 

V Jdn J 

admits at least one optimal solution. There are other regularized variants of (5) 
which are well-posed and we refer to [9], [11] for such existence theories. Note 
that, although existence is not an issue of the present paper, we shall work with a 
smoother subset of (4), in order to define a proper notion of shape derivative. 



3. Shape derivative 

In order to apply a gradient method to the minimization problem (5), we recall a 
classical notion of shape derivative. This notion goes back to Hadamard, at least, 
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and several researchers have contributed to its development (see e.g. the reference 
books [16], [21]). Here, we follow the approach of Murat and Simon [13], [20]. 
Starting from a smooth reference open set 11, we consider domains of the type 

n e = (id + 0)(n), 

with 0 G W rl,00 (M rf , R d ). It is well known that, for sufficiently small 0, (Id + 0) is 
a diffeomorphism in R d . 

Definition 3.1. The shape derivative of J(O) at H is defined as the Frechet de- 
rivative J'(H) in W rl ’°°(M d ,M d ) at 0 = 0 of the application 0 — > j((Id + 0)(H)), 

i.e. 

J((ld + 0)(fi)) = J(0) + J'(fi)(0) + o(0) with lim^|l=0, 

where J'(fi) is a continuous linear form on VF 1,00 (E d , R d ). 

A classical result states that the directional derivative J' (H)(0) depends only 
on the normal trace 0 • n on the boundary 5H. 

Lemma 3.2. Let Q be a smooth bounded open set and J(H) a differentiable function 
at fi. Its derivative satisfies 

j'mvi) = 

for every 0i,02 £ W 1 , 00 (M d ;M d ) such that 02 — 0 i G C 1 (M d ;M oJ ) and 

0 i • n = 02 • n on < 9 H. 

We give two examples of shape derivatives that will be useful in the sequel. 
Lemma 3.3. Let H be a smooth bounded open set and f(x) G W 1,1 (M d ). Define 

J(H) = f f{x) dx. 

Jn 

Then J is differentiable at H and 

J' (H)(0) = f div(0(x) /(#)) dx = f 6 (x) • n(x) f(x) ds 

Jn Jon 

for every 0 G VF 1,00 (M d ; R d ). 

Lemma 3.4. Let fi be a smooth bounded open set and f(x) G W 2 , 1 (M d ). Define 

J(fi) = f f(x)ds. 

J dQ, 

Then J is differentiable at fi and 

J '^ = JjA¥n + Hf h 

for any 0 G VF 1,00 (E d ; R d ), where H is the mean curvature of <9fi defined by H = 
divn. 
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Remark 3.5. In particular, Lemma 3.3 is useful in order to compute the shape 
derivative of a volume constraint V (ft) = C. Indeed, we have 

V(ft)= [ dx and V'(ft)(6) — f 9(x)-n(x)ds. 

Jn Jan 

Similarly, Lemma 3.4 is useful in order to compute the shape derivative of a perime- 
ter constraint P(ft) = C. Indeed, we have 

P(ft)= [ ds and P'(ft)(0)= [ 6(x) • n(x)H ds. 

Jan Jan 



Theorem 3.6. Let ft be a smooth bounded open set and 6 E VL 1,00 (E rf ; R rf ). The 
shape derivative of ( 2) is 



Jan 



d(g ■ u) 

dn 



+ Hg • u 



— Ae(u) • e(u) 



6 • nds. 



( 7 ) 



The shape derivative of (3) is 
'd(g-p) 

, \ 

tan 



J'(O)(0) = [ 

Jat 



C { 



+ Hg • p — Ae(p) • e(u) + — k\u — uo\ a ) 6 • nds, 

a 



2v ““ /vv/ 1 * dn 

where u is the solution of ( 1), and p is the adjoint state, solving 



— div (Ae(p)) = Cok(x)\u — uq 

p = 0 

(Ae(p))n = 0 
where Co is a constant given by 



| a 2 (u — uo) in ft, 
onTch 
on V tv, 



Co 



[ k(x)\u(x)\ a dx 

Jn 



l/oi-l 



(8) 

( 9 ) 



Remark 3.7. Remark that there is no adjoint state involved in (7) (indeed the 
minimization of (2) is a self-adjoint problem). 



Proof Although Theorem 3.6 is a classical result (see e.g. [13], [16], [20], [21]) we 
briefly sketch its proof for the sake of completeness. To simplify we give a short, 
albeit formal, proof due to Cea [10]. We consider a general objective function 

J(ft)= / j(x,u)dx , 

Jn 

for which we introduce the Lagrangian, defined for (v,q) E (H 1 (R d ;R d )) 2 by 



C(ft,v,q) = / j(x,v)dx+ / Ae(v) • e(q) dx — / q • / dx 

Jn Jn Jn 

— q-gds— (q-Ae(v)n + v-Ae(q)n)t 
J r N Jr D v ' 



( 10 ) 



In (10), q is a Lagrange multiplier for the state equation and its boundary con- 
ditions. It is worth noticing that v and q belong to a functional space that does 
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not depend on ft, so that we can apply the usual differentiation rule to the La- 
grangian £. The stationarity of the Lagrangian is going to give the optimality 
conditions of the minimization problem. For a given ft, we denote by (u,p) a sta- 
tionary point for C. Taking the partial derivative of C with respect to q in the 
direction </> G and integrating by parts the resulting Euler condition, 

we obtain 



(^(n,u,pU) 



o 



J </> ■ ^div(.Ae(u)) + f^jdx 

/ </>■ ((Ae(u))n - g) ds 

JTn 

L 



(ii) 



u • Ae{(j))nds. 



First, choosing a test function (j) with compact support in ft, we deduce from (11) 
the state equation. Then, varying the trace function (j) on Tn gives the Neumann 
boundary condition for u , while varying the corresponding normal stress ( Ae((f)))n 
on gives the Dirichlet boundary condition for u. On the other hand, in order 
to find the adjoint equation, we differentiate C with respect to v in the direction 
This yields 

,dC. 



( 



dv 



(ft,u,p),<£) = 0 = / j' {u) • (j) dx + / Ae(4>) • e(p) dx 
Jn Jn 

^ p • Ae((f))n + 0 • Ae(p)n^ ds. 



-L 



Integrating by parts we obtain 
,dC 



(j'{u) - div(4e(p))) • (f>dx + (j> ■ (Ae(p))nds 



L 



p • Ae(4>)n ds. 



Letting (j) vary among the functions with compact support in ft gives the adjoint 
state equation 

— div(Ae(p)) = — j'(u ) in ft. 

Then, varying the trace of ^ on Tn yields the Neumann boundary condition 

(Ae(p))n = 0 on Tn- 

Finally, varying the normal stress (Ae(0))n on Tp gives 

p — 0 on V D . 



We have therefore found a well-posed boundary value problem for the adjoint state 

P • 

The shape derivative of the objective function is obtained by differentiating 
J(ft) = £(ft,w(ft),p(ft)), 
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which, by the chain rule theorem, reduces to the partial derivative of C with respect 
to Q in the direction 6 

BC 

j'(n)(0) = —(n,u, P )(9). 

Applying Lemma 3.3 and 3.4, we obtain 



dC 

dn 



(fl,u,p)(0) = 



J d ' n ( j(u ) + Ae(u) ■ e(p) - p- /)ds 

i 

-L 



9 -n [ + H g ■ p ) ds 



( 12 ) 



6 • n [ - — h Hh ) ds, 
k on 



with h = u ‘ Ae(p)n + p • Ae(u)n. Taking into account the boundary condition 
u = p = 0 on T o which also implies 



, t N , v du dp ( dn \ ( dp \ 



we deduce 

dC 

on 



(n,u,p)(9)= 9 ■ n ^j(u) + Ae(u) ■ e(p) - 

+ / 6 • n(j(u) — Ae(u) • e(p)^ ds. 

JYr> 



9(9 • P) 

dn 



on r^, 



Hg-p 



ds 



This proof is merely a formal computation (in particular we have to assume u and 
p to be differentiable with respect to the shape Cl) but it can be rigorously justified 
(see the references quoted above). □ 



Remark 3.8. We can generalize Theorem 3.6 to more general objective functions, 
including functions of the strain or stress. It is also possible to consider non homo- 
geneous Dirichlet boundary conditions in the state equation, or even a non-linear 
model of elasticity. 

Remark 3.9. It is possible to further restrict the class of domains by requiring some 
parts of the boundary T f ixe d not to move: specifically, the map 6 must belong to 

T a d = {0 G W 1,00 (R d ,M d ) such that 6 = 0 on Tf ixe d} . 



4. Front propagation by the level-set method 

Let a bounded domain D C R d be the working domain in which all the admissible 
shapes are included, i.e. Q c D . In numerical practice, the domain D will be 
uniformly meshed once and for all. Then, we shall capture the shape Q on this 
fixed mesh. To this purpose, we parametrize the boundary of Q by means of a 
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level-set function, following the idea of Osher and Sethian [15]. We define this 
level-set function ip in D such that 

{ ip(x) = 0 x e dftn D 
'ip(x) <0 ^ x e ft 

'ip(x) >0 o x e (D\ft) 



The normal n to the shape ft is yielded by V^/|V^| and the curvature H is given 
by the divergence of n (these quantities are evaluated by finite differences, since our 
mesh is uniformly rectangular). Remark that, although n and H are theoretically 
defined on dft only, the level set method allows to define easily their extension to 
the whole domain D. 

Following the optimization process, the shape is going to evolve according to 
a fictitious time, which corresponds to descent stepping (we shall come back to 
this issue in the next section). As it is well-known, if the shape is evolving in time, 
then the evolution of the level-set function is governed by a simple Hamilton- 
Jacobi equation. To be precise, assume that the shape ft(t) evolves in time t E R + 
with a normal velocity V(t,x). Then 

\p(t,x(t)^ = 0 for every x(t) E dft(t). 

Differentiating in t yields 

^ + x(t) -V x i/>=^+Vn- = 0. 

Since n = V^/IV^I we obtain 

^ + V|V*ifl=0. 

This Hamilton Jacobi equation is posed on the whole of the box D, and not only 
on the boundary dft , if the velocity V is known everywhere. Remark that the 
level-set method easily allows to compute the mean curvature H = divn (which 
plays an important role in a perimeter penalization). 

It is solved by an explicit first order upwind scheme (see e.g. [17]) 



M 

At 



+ min(V7\0) g~ (D+W, D~^) + max^", 0) 5 + (Z>+<. ^ C) = 0, 



with D+W = D x ^ ^ £ 1 , and 



# + (d + ,d ) = y/min(d+ , 0) 2 + max(d" , 0) 2 , 



g (d + ,d ) = ^/max(d+,0) 2 + min(d ,0) 2 . 

In order to regularize the level set function (which may become too flat or 
too steep), we re-initialize it periodically by solving 



+ sign(^) (jV^I - l) = 0 infix R + , 
^(t = 0, x) — ipoix) in D, 



( 13 ) 
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which admits the signed distance to the initial interface {^o(x) = 0} as a stationary 
solution. In numerical practice, re-initialization is very important because the level 
set function often becomes too steep, which implies a bad approximation of the 
normal n or of the curvature H. 

5. Optimization algorithm 

For the minimization problem 

inf ij(fi) 

QtzUad ^ 

we have computed a shape derivative 

J f (SY)(0) = I vB’inds , 

Jon 

where the function v is given by a result analogous to Theorem 3.6. Ignoring 
smoothness issues, a descent direction is found by taking 

0 = —vn. 

Therefore, we choose the normal component 6 'in = — v to be the advection velocity 
in the Hamilton- Jacobi equation 

- v\Vi[>\ = 0. (14) 

Transporting ^ by (14) is equivalent to move the boundary of £ 1 (the zero level 
set of 2 p) along the descent gradient direction — J'(£i). Finally, our algorithm is an 
iterative method, structured as follows: 

1. Initialization of the level-set function ipo corresponding to an initial guess 
Jio- 

2. Iteration until convergence, for k > 0: 

(a) Computation of the state u k and the adjoint state p k through two 
problems of linear elasticity. 

(b) Deformation of the shape through the transport of the level set func- 
tion: i(x) = ^(A£fc,x), where i/j(t,x) is the solution of (14) with 

velocity Vk = v(uk,Pk) and initial condition ^(O,#) = xjj k(x ). The 
time step A tk is chosen in such a way that J(D^ + i) < J(flfc). 

3. From time to time, for stability reasons, we also re-initialize the level set 
function ip by solving (13). 

In order to avoid an explicit meshing of the shapes we compute the state 
Uk and the adjoint state pk on the whole working domain D. For this, we fill the 
empty part D\Clk with a very weak material with Hooke’s law B = 10 _3 A and we 
perform the elasticity analysis on a fixed rectangular mesh on D (using Q 1 finite 
elements). Then, the Hooke’s law in D is 

a / \ _ f A where ^ k {x) < 0 

k\ x ) | 10 _3 A where ipk(x) > 0, 
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completed by a simple linear interpolation (proportional to the volume) of these 
values in the mesh cells where ipk changes sign. 

Since n and H, as well as the state u and the adjoint state p, are computed 
everywhere in D , the shape derivative (see formulae (7) and (8)) delivers a normal 
velocity — v which is defined throughout the domain D and not only on the free 
boundary dQ. Therefore, we do not ever have to know where the boundary di 1 
is precisely and we apply the same numerical scheme everywhere in the working 
domain D. 

Since the Hamilton- Jacobi equation (14) is solved by an explicit scheme, the 
time step A tk must satisfy a CFL condition. Remark that one explicit time step 
for (14) is much cheaper, in terms of CPU time and memory requirement, than the 
solution of the state equation (1) or the adjoint state equation (9). On the other 
hand, the usual time step A tk for (14) is often much smaller that the optimal 
descent step for the minimization of the objective function J(f2). Therefore, for 
each iteration k in the above algorithm (corresponding to a single evaluation of 
Uk and p*;), we solve several time steps of the Hamilton- Jacobi equation (14). The 
number of such time steps per iteration k is monitored by the decrease of J(fifc). 

Our algorithm never creates new holes or boundaries if the time step A tk 
does indeed satisfy a CFL condition because of the maximum principle for (14) 
(there is no nucleation mechanism for new holes). However the level set method is 
well known to handle easily topology changes, i.e. merging or cancellation of holes. 
Therefore, our algorithm is able to perform topology optimization if the number 
of holes of the initial design is sufficiently large (see Figure 1). The algorithm 
converges smoothly to a (local) minimum which strongly depends, of course, on the 
initial topology (see the differences in Figures 2 and 3). For a “good” initialization, 
the numerical results are very similar to those obtained by the homogenization 
method but the convergence is usually slower (although we have not yet tried to 
speed it by a quasi-Newton algorithm). 



6. Numerical examples 




Figure 1. Boundary conditions and two initializations of a 2-d cantilever 

We first consider a compliance problem and neglect body forces, i.e. / = 0 in 
(1). The objective function is a combination of the compliance and of the weight 
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of the structure 

J(fl) = I g-uds + i I dx , (15) 

J r tv J ^ 

where ^ > 0 is a Lagrange multiplier. We further impose that the Dirichlet boun- 
dary r D , as well as the portion of the Neumann boundary T n where the loads 
are applied, are kept fixed during the homogenization process. Thus, only the 
traction-free Neumann boundary is allowed to move. The boundary conditions 
and two initial configurations for a plane cantilever are displayed on Figure 1. 
The results are shown on Figures 2 and 3 for two different iteration numbers, 
showing the strong influence of the initialization. The convergence is smooth and 
fast (see Figure 4) and 20 explicit time steps of the Hamilton- Jacobi equation were 
performed at each elasticity iteration. 




Figure 2. Iterations 10 and 50 of the two-dimensional cantilever 
initialized as in Figure 1 (middle) 




Figure 3. Iterations 10 and 50 of the two-dimensional cantilever 
initialized as in Figure 1 (right) 

The advantage of the level-set method is that it can be easily extended to 
three space dimensions. A three-dimensional optimal electrical mast is shown in 
Figure 5 (see [2] for a precise definition of this test case). 

Next, Figure 6 shows a numerical result for the least square objective func- 
tion (3), where k(x) and uq(x) have been chosen in such a way that the jaws of 
the mechanism close. This is a classical gripping mechanism test case which is 
described, e.g., in [1], [19]. 
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Figure 4. Convergence of the objective function for the two- 
dimensional cantilever of Figure 3 




Figure 5. Three-dimensional electrical mast. 



We come back to the compliance objective function (2) with design dependent 
loads and take g as a uniform pressure load on the free boundary T Imposing 5 
Dirichlet points yields a nice sea star, as shown in Figure 7. 





Structural Optimization 



13 



iA(nil fewer I 

— n 

i 



Figure 6. Boundary conditions, initialization, and deformed op- 
timal shape of a plane gripping mechanism. 




Figure 7. Optimal shape under a uniform pressure load with 
five anchor points. 

Our shape optimization method can be easily extended to models of non- 
linear elasticity or to multiple loads problems. This issue, as well as the analysis of 
the effect of first-order or second-order discretizations and of re-initialization in the 
numerical convergence towards an optimal shape, are the subject of a forthcoming 
paper. 
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On a Variational Problem Arising 
in Image Reconstruction 

Luigi Ambrosio and Simon Masnou 



Abstract. We consider a variational approach to the problem of recovering 
missing parts in a panchromatic digital image. Representing the image by a 
scalar function u, we propose a model based on the relaxation of the energy 

Vu 



/ 



| Vix|(a + P 



div 



|Vu| 



OL,ft > 0, p > 1 



which takes into account the perimeter of the level sets of u as well as the 
L p norm of the mean curvature along their boundaries. We investigate the 
properties of this variational model and the existence of minimizing functions 
in BV. We also address related issues for integral varifolds with generalized 
mean curvature in L p . 



1. Introduction 

Several problems in digital image processing require the ability to recover missing 
parts of an image or to remove spurious or undesired objects. One can mention 
for instance the removal of scratches in old photographs and films, the recovery of 
pixels blocks corrupted during a binary transmission (or analogously the removal 
of impulse noise) or the removal of undesired advertisements, text or subtitles from 
a photograph. 

A digital image is usually modelled as a function u from a bounded domain 
of IR n (N = 2 for usual snapshots, N = 3 for medical images or movies, N = 4 for 
moving medical images) onto IR M (M = 1 for a grey-level image, M = 3 for colour 
images). Since it is now well admitted that the essential features of any natural 
image are contained in its grey level representation, we shall concentrate on the 
panchromatic case M = 1. 

After the work of L. Rudin and S. Osher [24], the usual representation of a 
panchromatic image is a sum of two components u\ G BV(IR N ) and u 2 G L 2 (IR N ). 
The component u\ is supposed to describe the geometry of the image, i.e. its 
objects and their boundaries, while u 2 contains all information about texture and 
additive noise. The assumption that the geometry of the image can be described 
by a function of bounded variation sounds quite natural, for it means that there 
can be discontinuities in the image but supported on rectifiable curves. The need 
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for another component not necessarily belonging to BV can be corroborated by an 
experimental procedure which seems to point out that, given a digital image, the 
subjacent “real” image may be often too oscillating to belong to BV (see [2] for 
the details and [11] for connected theoretic issues). The reader may refer to [3, 17] 
for a thorough survey of the space BV. 

In recent years, a wide literature that has been published in on the recovery 
of missing parts in a digital image: in this framework, one can basically distin- 
guish between two approaches and each of them corresponds in some way to the 
processing of one component in the decomposition above: 

• the stochastic approach, which is based on the modelling of an image as 
a realization of a random process. This modelling is particularly adapted 
for texture images (thus to the processing or the component U 2 in the 
previous decomposition) and has motivated numerous works on texture 
analysis and synthesis [13, 14, 20, 27, 28], 

• the deterministic approach, aiming at recovering the geometry of the im- 
age. Motivated by the seminal work of Mumford, Nitzberg and Shiota [23], 
several variational or PDE methods related to this approach have appeared 
very recently [7, 9, 10, 15, 21, 22]. 

Our purpose here is to discuss the mathematical aspects of a model from the 
latter category, which was introduced in [21]. Its underlying principle is to mimic 
a natural ability of the human visual system, the so-called “amodal completion” 
process, that has been described and studied by the Gestalt school of psychology 
and, particularly, G. Kanizsa [19]. In fact, our brain is able to reconstruct arti- 
ficially a partially occluded object by interpolating its edges by means of curves 
offering a compromise between shortness and good continuation. The model intro- 
duced in [21] is based on the following claim: one indeed assumes that, in order to 
recover the missing part of an image or, analogously, to remove a spurious object 
(both can be considered as occlusions) , it suffices to complete the level lines known 
outside the occlusion by means of short and not too oscillating curves that do not 
cross themselves or each other — and thus can be considered as level lines. Then 
the full image can be reconstructed, since it is well known that any Borel function 
is fully determined by its level sets. This constructive process, called disocclusion 
process in [21], can be written in a variational form: ideally, each interpolating 
level line should minimize an energy of the type 



l 



(a + PlK^dli 1 , p > 1, a,(3> 0, 



{u=t} 

which penalizes the length and the oscillations. After summing over all the levels 
and using the change of variables formula, we get the variational criterion 



F(u)= [ \Vu\(a + [3 

JA 



div 



Vu 

iv^i 



)dx, 



with the convention that the integrand is 0 wherever |V^| = 0. Our problem is 
now to minimize F over all the functions coinciding with the original image uq 
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outside the occlusion domain A. The integration domain A in F is an open set 
strictly containing A ; in this way, the interpolant is forced this way to verify the 
good continuation principle. Of course, F makes sense for smooth functions only, 
but it can be easily extended to L 1 by setting F(u) = oo for any u G L 1 . Then, as 
usual in the direct method of the calculus of variations, we introduce the relaxed 
functional associated with F, defined by 

F(u) = inf{liminf F(rqJ, Uh —> u G L 1 }. 

h— >oo 

Besides, it is straightforward to generalize F — thus F — to higher dimensions, in 
which cases it is no more associated to the minimization of f (a + over 

curves but to the minimization of f (a + /?|H| v )dTL l over hypersurfaces, where H 
denotes the mean curvature vector. 

This paper is organized as follows. In Section 2, we state a locality result for 
the mean curvature vector H of integral (N — l)-varifolds when HGL p ,p>iV — 1, 
p > 2 (Theorem 2.2). A direct consequence of this result is the lower semicontinuity 
(Theorem 2.4) of the functional 



/ (l + |H E HdW N_1 , p> N-1,N>Z 

JdE 

with respect to the convergence in L 1 in the class of sets E C IR N with dE G C 2 . 
This result extends to higher dimensions a previous result due to G. Bellettini, G. 
Dal Maso and M. Paolini [6] for N = 2, p > 1. 

Section 3 is devoted to the study of the disocclusion problem in dimension 
N. We prove the existence of an optimal solution (Theorem 3.1), the coinci- 
dence between F and the associated relaxed functional F for smooth functions 
(Theorem 3.2) and give some results on the regularity of the optimal solution in 
the particular case N = 2 (Corollary 3.4). For the sake of simplicity and with 
absolutely no loss of generality, we shall assume that a = (3 = 1 in the sequel. 



2. Locality of the mean curvature and applications 

The main result of this section is stated in Theorem 2.2, whose proof is based on 
the quadratic decay of the tilt-excess established by R. Schatzle in [25]. We recall 
his result below (the notation is the standard one in geometric measure theory, 
see [3, 16, 26]). 

Theorem 2.1 (Quadratic tilt-excess decay [25, Thm. 5.1]). Let U = y(M,0) be an 
integral N — 1-varifold in an open set Q C IR N with weight and mean curvature 
Hu e KM, p > N — l, p > 2. Then for fiu -almost allx G spt ix v , the tilt-excess 

tiltex M (x, p) := p l ~ N f \\T y M - T X M\\ 2 dp u (y) 

Jb^(x) 

decays quadratically, that is 



tiltex M (x,p) = O x (p 2 ). 
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Theorem 2.2 (Locality of the mean curvature). Let U = y(M,6u), V — v(A ^6y) 

be integral ( N - 1 )-varifolds in fl C IR N . If Hu G Lf oc (^ c/ ) and H v G Lf oc (/v) for 
some p> N — 1, p>2 then 

HuW-HvW 

for Tp- 1 -almost all x G M D N. 

Proof, (see [4]) It is based on the relationship between the first variation of a 
varifold, the approximate tangent space and the mean curvature, combined with 
Theorem 2.1 and finally on a result by Brakke on the orthogonality of the mean 
curvature vector [8]. □ 

Theorem 2.2 is a key point in the proof of the lower semicontinuity of the 
mean curvature’s L p norm, stated in Theorem 2.4 below, which is a generalization 
to the higher dimensional case of a previous result obtained in dimension two by 
G. Bellettini, G. Dal Maso and M. Paolini in [6]. We recall their result below. 

Theorem 2.3 ([6, Thm 7.1]). Let fl be an open subset of JR 2 , let p > 1 and let E 
be an open bounded subset of IR 2 such that dE fi ft G C 2 . Then 

[ (1 + \K,\ p )d'H 1 < liminf f (1 + \nh\ p )dTL l 

JdEnn h -^°° JdE h nn 

for any sequence (Eh)heJN of bounded open sets such that dE h fl fl G C 2 and 
Eh — > E in L 1 (fi) as h —> oo. 

Theorem 2.4 (Lower semicontinuity of the mean curvature’s L p norm). 

Let fl be an open subset ofJR N , N > 3, and p > N — 1. Let C 1R N be 

converging in L 1 (Q) to E, with dEh fi 0 G C 2 and dE fl Q G C 2 . Then 



(I + IHe^)^” 1 < liminf 



(l + |H Eh n^ 



where H E f resp . H Ehy ) denotes the mean curvature vector on dEC ifi (resp. dEhC \Q). 



Proof. Due to the lower semicontinuity of the perimeter, it is clearly sufficient to 
prove the part of the claim that involving the curvature. We can assume that the 
right hand side of the inequality to prove is finite, otherwise the result is trivial. In 
addition, possibly taking a subsequence, we can assume without loss of generality 
that 



sup / (1 + |H e J p )dH N 1 < C < oo. 

he in JdE h nn 

Let Vh = v(dEh H fl, 1) be the unit-density rectifiable (N — l)-varifolds associated 



with the sets EhHfl and let p Vh = Tt N ~ 1 \—dEh fl fl be the corresponding weights. 
By the divergence theorem, the first variation of the varifolds Vh in fl can be 
written as 

SVh(X) = - [ X ■ H Eh dii Vh , VX e 01(0, IR n ) 

Jn 

hence the L p norms of SVh with respect to fi Vh are uniformly bounded. 
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By Allard’s compactness theorem (see [1] or Theorem 42.7 in [26]), we obtain, 
possibly passing to a subsequence, that there exists a limit integral (N — l)-varifold 
V in such that Vh — ^ V and V = v(M, 0y), with M a countably W N -rectifiable 

set and 6y a positive integer- valued and locally W N -integrable function on fh 
As SVh = H Eh d/j, Vh — ^ <W, a well-known lower semicontinuity theorem (see for 
instance Example 2.36 in [3]) yields that SV = H v /V with H v E L p (fi v ) and 

[ 0y|H v | p dW N “ 1 < lim inf [ |H E J p dW N-1 . (1) 

JM h ~*°° JdE h nn 

Notice that so far we have just used the fact that p > 1. 

Now we show that W N_1 -almost all points in dE n £] belong to M and that 
H v coincides with the classical mean curvature H E for 7Y N_1 -almost every point 
of dEnQ whenever p > N — 1. Let x E dEnft. Since E has finite perimeter, for all 
r > 0 except possibly for a countable set, p v (dB^(x)) = 0, hence p v (B^(x)) = 
lim jji Vh (B?(x)) = lim H N - 1 (B^(x)ndE h nn) > W N_1 (Bf (x)n5Enfi), using 

h—* oo h—* oo 

the lower semicontinuity of the perimeter. It follows that x is a point where the 
lower (N — l)-dimensional density of p v is strictly positive. As \i v — 9yH N ~ 1 \—M, 
H n ~ 1 -almost every point with this property belongs to M. 

Let U := v (dE D f], 1) be the unit-density rectifiable ( N — l)-varifold in Q 
associated with dE D 0. By the divergence theorem we have 



Hu(x) = H e (x) 

for H n ~ 1 -almost all x E dE fl Q. Since dE D 0, E C 2 , it is easily seen that H n E 
L locil^u)- By Theorem 2.2 we obtain 



H e (x) = H u (x) = H v (x) 

for 7Y N_1 -almost all x E MndEnft and therefore for W N_1 -almost all x E dEilQ. 

Plugging this into (1) and using that Oy(x) > 1 for H N-1 -almost every 
x E M, we finally obtain 



L 



dEOQ 



|H e | p dH 



N — 1 



< [ |H V | p e v dH n - 1 

J dEHQ 



< lim inf f |H Bl .| p dW N_1 

JdE h 



and the theorem follows. 



□ 



Remark 2.5. The varifold arguments we use require the technical assumption p > 
2, which prevents the result by Bellettini et al from being a particular case of 
Theorem 2.4 whenever N = 2 and 1 < p < 2. 

Remark 2.6. Whenever H N ~ 1 (dEnfl) = lim^oo / W N_1 (<9E/ 1 n£7), the lower semi- 
continuity is true for any p > 1. This is an easy consequence of (1) and of Reshet- 
nyak continuity Theorem (see for instance [3, Thm. 2.39]), which implies that 

V = \(dEnn, 1). 
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3. Analysis of the disocclusion problem 

Let 0 be a bounded open set in IR^ (AT > 2) with Lipschitz boundary, representing 
the image domain, let A CC $2 be an open, connected set with Lipschitz boundary 
representing the occlusion and let uq £ BV(fi \ A) be the original image. By 
Theorem 3.87 in [3], for any A G IR the function given by = u$ and u\a — A 
belongs to BV(fi). This ensures that the occlusion can always be filled in. 

Let 0(0) denote the family of open subsets of 0. We consider the functional 
mapping L 1 (f2) x 0(0,) onto [0, oo] and defined for every (u,B) G L 1 (fi) x 0(0) 
by 

F p (u, B) := 

^ +oo otherwise 

with the convention that the integrand is 0 wherever \Vu\ — 0. 

The relaxed functional associated with F is defined for every (u, B) G L x (0) x 
0(0) by 

F p (u, B) := inf { liminf F p (uh , B) : u h ^ 

l h — »oo J 

Since F p (u, B) > f B \ Vu\ dx whenever u G C 2 (B ), the lower semicontinuity of the 
total variation yields 

F^(u,B) > \Du\(B) V(t/, B) G L\0) x 0(0). (2) 

In the sequel, we assume that there exist an open set A C 0 such that A 2D A 
and a function u G L 1 (fl) such that u = u 0 on 0\ A and F p (u, A) < oo. This could 
be considered as a mild regularity and compatibility condition between the image 
and the occlusion. 

Theorem 3.1. The problem 

Min { F p (u , A) : u = uq on 0\A} (3) 

has at least one solution u G BV(fl). 

Proof. Let (vh)he in C L 1 (fl) be a minimizing sequence. Without loss of generality 
we may assume that sup he]N F p (vh, A) < oo. Then (2) yields sup^ GlN \Dvh\(A) < 
oo, and therefore sup /lGlN \Dvh\(0) < oo because Vh = no on O \ A. Since the 
values of Vh are fixed on 0 \ A, the generalized Poincare inequality given in The- 
orem 5.11.1 of [29] ensures that the L 1 (fi) norms of Vh are uniformly bounded. 
Hence sup^ G]N ||f/J|BV(0) < and there exists a subsequence, still denoted by 
(vh)he in, converging in L 1 (fl) to a function u G BV(fl). Obviously, u = uq on 
0\A. From the lower semicontinuity of F p we finally obtain that 

F p (u, A) < liminf F p (vh , A) = inf {F p (v, A) : v = uq on 0\ A }, 



/' v “'( i+ 



div 



\7u 



|V«| 



^ dx if u e C 2 (B) 



and the theorem ensues. 



□ 
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Now we can show that the relaxed functional coincides with F p on C 2 func- 
tions. The proof is based on the geometric lower semicontinuity results of the 
previous section and on the identity 

F p (u,B) = [ [ (l + |H {u>t} \P)dH n ~ l dt Vu e C 2 (B). (4) 

JJR Jd{u>t} 

The above identity is a straightforward consequence of the coarea formula and of 
the equality H = — div( wherever |V^| > 0, see [3]. 

Theorem 3.2. Let B C ft be an open set and assume that N > 2 and p > N — 1. 
The functional F P (-,B) is lower semicontinuous on L 1 (f]) n C 2 (B) with respect to 
the L 1 topology. In particular 

F p (u , B) = Fp{u, B) Vu e C 2 {B). 

Proof. Let (uh)he in C L 1 (fi) D C 2 (B) be converging in L 1 (B) to u E C 2 (B) and 
set L := liminf F p (uh,A), assuming with no loss of generality that L < oo, that 

h — >oo 

the lim inf is a limit and that Uh converge a.e. to u. By the dominated convergence 
theorem, X{u h >t} X{u>t} h 1 L l (B) whenever {u = t} is Lebesgue negligible, 
hence for a.e. t G IR. In addition, by Morse Theorem, for almost every t E IR, 
{uh > t}, h G IN, and {u > t) have smooth boundaries. Therefore, by applying 
either Theorem 2.3 or Theorem 2.4 we obtain that 

[ (1 + < liminf f (1 + |H {Uh > t} n ^" 1 

J d{u>t}DB h-+oo J 0{ Uh >t}C\B 

for a.e. t E IR. Integrating over IR and using Fatou’s lemma, (4) yields 

F p (u,B ) < lim inf F p (uh,B). □ 

h — >oo 

In the two-dimensional case we can say something more about the structure of 
the solutions. The following theorem is easily deduced from the proofs of Theorems 
4.1 and 7.1 in [6]. In particular, it suffices to replace dE by d*E in the last part 
of the proof of Theorem 7.1, page 292. 

Theorem 3.3 ([6]). Let Q C IR 2 be an open set and p > 1. Let E be a Borel set 
such that there exists a sequence {Eh}he in of bounded open sets of class C 2 (fi) 
converging to E in L 1 (fl) and satisfying 

sup f (1 + \k\ p ) dH 1 < oo. 
he in JdE h nn 

Then E has finite perimeter in Q and there exists a locally finite family T = { 7 i}iei 
of regular curves of class W 2,p such that 

1 . d*Enn cU*=i(7i); 

2 . T is without crossings, i.e. whenever Xi{ti) = 7 j(^) £ ^ 

and ti, £2 € [ 0 , 1 ]. 
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Corollary 3.4. Let IR 2 be an open set and u £ L 1 (H) such that F p (u , B) < 

oo, with p > 1. Then, for almost every £ £ IR, there exists a locally finite family 
r* = {ji}iei t °f re 9 u ^ ar curves of class W 2,p such that d*{u > t} D B C \J ieIt (jj) 
and r* is without crossings. 

Proof. Let {uh}he in C C 2 (B) be converging to u in L x (f]) and a.e. and satisfying 
L := lim F p (uh,B) = F p (u,B ) < oo. Using Fatou’s Lemma and (4) we get 

h— -xx) 

[ lim inf [ (1 + |H {Uh>t} | p ) dW N_1 < 

JTR Jd{u h >t}nB 

lim inf / [ (l + |H {Uh>t ,nd?f N - 1 = L<oo, 

^°° ^ir 

thus lim inf / (1 + |H {Uh>t} | p )d7Y N_1 is finite for almost every t £ IR. 

Jd{u h >t}nB 

The conclusion follows by applying Theorem 3.3 for almost every t £ IR, possibly 
passing to a subsequence (depending on t). □ 

An obvious consequence of this result is that the same regularity holds for 
the boundaries d*{u > t] fl A (for almost every £ £ IR) of any solution of the 
disocclusion problem in IR 2 . In addition, since this regularity holds in A and not 
only within the occlusion A , it gives a necessary condition for the existence of a 
solution: namely, the level lines of the initial function uq must satisfy this regularity 
property, at least near the boundary of A. As a consequence, the only way — es- 
sentially — for these level lines to intersect on d A is to form a cusp point. 

Remark 3.5. This regularity result cannot be extended to higher dimensions, due 
to the fact that controlling the mean curvature does not necessarily yield the 
regularity of a hypersurface. By Allard’s regularity theorem (see [1] or Theorem 
23.1 in [26]), a (N — l)-varifold with density 1 and generalized mean curvature in 
L p , p > N — 1, is supported on a set that can be locally represented as the graph 
of a C 1 ’ 1 p~ function (J. Duggan [12] showed that a W 2,p regularity actually 
holds). Unfortunately, such regularity does not apply to the multiple density case 
anymore. An example is given in [8] of a varifold V with bounded mean curvature 
whose support contains a set A of strictly positive measure with the property that, 
if a £ A, then spt V does not correspond to the graph of even a multiple- valued 
function in any neighborhood of a. Thus, controlling the mean curvature only is 
not sufficient. 

On the other hand, it has been shown by J. Hutchinson [18] that if the second 
fundamental form of a varifold V is in L p , p > N — 1, then V is locally supported 

i i N ~ l 

on the graph of a multiple- valued C ’ p function. 

In our disocclusion problem, we can neither ensure that the varifolds sup- 
ported on the sets d{uh > t} converge to unit-density varifolds, nor that the 
second fundamental form is uniformly bounded in L p , except in the particular 
case N = 2 where the mean curvature coincides with the second fundamental 
form. This explains why our regularity result is stated only for N = 2. 
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Ill-Posed Hele-Shaw Flows 
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1. Introduction 

The Hele-Shaw flow, introduced at the end of the 19 th Century in [14], is a model 
approximation of the motion of two immiscible fluids confined between two parallel 
planes, separated by a small distance 0 < h <C L, where L is a characteristic 
transversal length. Such a device is called a Hele-Shaw Cell. Its physical interest 
rests on its formal analogy with Darcy’s law. This makes it suitable for designing 
simulations of the flow of immiscible fluids in a porous medium. 

One such experiment consists in sucking out through a nozzle a blob of a 
viscous fluid, say for example oil, surrounded by a less viscous fluid, say for example 
water or air (see Figure 1). The blob recedes and abruptly ruptures into disordered 
fingers. Reversing the role of the two fluids, one can pump a bubble of air into oil, 
attempting to remove the oil “at infinity” out of the Hele-Shaw cell (see Figure 2). 
As before, the bubble advances and ruptures abruptly into disordered fingers. A 
further experiment consists in injecting a “finger” of water into oil within an 
infinite strip of finite width (see Figures 3,4). While the finger advances smoothly, 
experimentally it always seems to choose a “preferred” configuration among all 
the admissible ones. Receding blobs and advancing bubbles rupture in a “too 
disordered” way. Advancing fingers evolve in a “too ordered” fashion. 

The mathematical interest of the Hele-Shaw flows is in understanding these 
phenomena. The common feature is that all of them are ill posed in the sense 
of Hadamard [13]. A direct relation to such an ill-posedness stems from their 
distributional formulation. Let u be the pressure of the more viscous fluid. The 
set occupied by this fluid is [u > 0] whereas the interior of set [u = 0] is the 
set occupied by the less viscous fluid. It turns out that all the aforementioned 
experimental settings can be translated into the distributional formulation, 

+ Au 3 0 in V (Hele-Shaw cell), (1) 

where H(u) is the Heaviside function of u. The natural initial datum associated 
with (1) is H(u(-, 0)), i.e., the geometrical configuration of the set occupied by 
the more viscous fluid at time t — 0. The formulation (1) represents a “forced 
backward” evolution of a process which is not reversible. This is the origin of 
the mathematical and physical ill-posedness of these problems. It turns out that 
solutions can be identified only for a very limited class of initial configurations 
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of the two fluids. The main mathematical problem consists in identifying such 
configurations. 

In this note we review these issues and point out the open problems. In § 2 
we derive the Hele-Shaw approximation starting from the Navier-Stokes equa- 
tions in the Hele-Shaw cell and we set up the basic formalism of free boundaries. 
In § 2.1 we give a first pointwise mathematical formulation of each of the three 
experiments described above and discuss their structures and differences. A com- 
mon distributional formulation is derived in § 4. The problem of receding blobs is 
examined in § 5 and the initial configurations yielding solutions are characterized 
as the coincidence sets of suitable variational inequalities. The problem of expand- 
ing bubbles hinges upon identifying the initial air bubble that would remove the 
oil out of the cell “at infinity”. Surprisingly such initial bubbles are homeoids , i.e., 
sets for which a uniform distribution of masses on their boundary would generate 
constant Newtonian potential inside their cavity. These results and such a connec- 
tion with potential theory form § 6. Receding blobs and expanding bubbles are 
reasonably understood [8, 9], while the problem of advancing fingers is not, and 
we concentrate on it in the remaining sections. 

The “experimental” finger (see Figure 3), chooses each time a symmetric 
configuration with an asymptotic width which is exactly one-half of the width of 
the channel. The physical mechanism by which such a selection occurs is not yet 
well understood. 

We first review recent results of [3] . The problem is transformed into a varia- 
tional inequality in an infinite strip and a precise asymptotic behavior is identified 
for a solution to exists. This asymptotic expansion implies uniqueness of solutions 
(modulo a choice of a coordinate system) and allows one to identify a peculiar 
quantity, which we call the area function , which might be used as a selection crite- 
rion of the “ physical ” solution among the admissible ones. Other possible criteria 
seem to be related to a motion by mean curvature of the free boundary and we 
discuss them in § 10. While supported by experimental evidence (Pitts [25]), a 
convincing mathematical explanation still eludes the researchers. 

In this note we have collected some unpublished remarks on the Hele-Shaw 
flow. Some of them pave the way to a unification of the theory, see sections 3 and 
4, and other ones, such as (18'), point to a novel way of formulating the problem of 
advancing fingers by means of a parameter S E (0, 1), weighting the contribution 
of the time-evolutions and of the travelling- wave structure of solutions. 



2. Hele-Shaw Cell [14, 19] 

A viscous incompressible fluid of kinematic viscosity //, velocity v and pressure p is 
confined within two parallel and slightly separated, horizontal planes 7r^, i = 1,2, 
at mutual distance 0 < h <C L, where L is a characteristic transversal length. 
Introduce Cartesian coordinates (x, y, z) in such a way that ni and 7r2 are identified 
with the planes {z = 0} and {z = h} respectively. The fluid is Newtonian and 
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locally governed by the Navier-Stokes equations. It is assumed that there are no 
external forcing terms, including gravity which is neglected, and that the velocity 
is small (|v| <C 1) and close to the steady state (v* « 0). Thus v satisfies the 
steady-state Stokes system 1 

— /iAv = Vp; divv = 0 for 0 < z <h\ 
v(>, y, 0; t) = v(x, y, h; 0) = 0. 

Since 0 < h <C 1 the vertical component v% of the velocity is negligible with respect 
to the horizontal components v\ and v 2 . Moreover the pressure p is independent of 
the vertical variable 2 . Since the term Av accounts for the internal friction of the 
fluid ([19]) and the motion takes place in ideal horizontal layers, the components 
Vi, xx and Vi^yy of the friction along the horizontal variables (x,y) are negligible 
with respect to the component V{ iZZ along the vertical variable. Combining these 
remarks, the Stokes system reduces to 



- pvi , zz =p x ; -HV 2 ,zz=Py', Vi,x + V2,y = 0 for 0 < z < h; 

Vi{x, y, 0; t) = Vi(x, y, h\ t) = 0 i = 1, 2. 



Multiply these by ( and integrate in d( over (0, z) for z G (0, h) and then integrate 
in dz over (0, h) to obtain, 



V = - 



12 p 



Vp 



where 



i r h 

Vi = - Vidz , I = 1,2. 

h Jo 



This is formally Darcy’s law for a fluid of viscosity /i, pressure p and fictitious 
porosity h 2 / 12. Incompressibility implies div V = 0 and A p = 0 within the fluid. 
This derivation is local in the interior of the fluid. A Hele-Shaw cell is the system 
of two slightly separated planes filled with viscous fluids whose dynamics has been 
so approximated. Because of the formal analogy with Darcy’s law, Hele-Shaw cells 
serve to simulate experimentally flows in porous media ([14, 19]). 



2.1. Immiscible Fluids in a Hele-Shaw Cell 

Two viscous, immiscible, incompressible fluids of kinematic viscosities //j, velocities 
Vj and pressures Pj, j = 1,2 are confined within a Hele-Shaw cell. A similar 
derivation in the interior of each fluid yields 



Vj = V<pj 
A cpj = 0 



in the fluids, where 



1 h 2 . 1 n 

= 1= ' 



( 2 ) 



The functions are the velocity potentials of the two fluids and each of them 
is defined within its corresponding fluid. Let tt(t) be the domain occupied by the 
fluid of viscosity /i 2 at each time t. Then the fluid of viscosity pi occupies the 
complement of Q(t) within the Hele-Shaw cell. The interface T(t) separating the 



X A, V and div are meant with respect to the space variables. 
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two fluids at time t is in local instantaneous equilibrium. Assuming that the surface 
tension is negligible amounts to, 2 



Pi — P 2 on the interface F(t). ( but not in general cpi = P 2 ) (3) 

If the motion evolves during some time interval (0 , T), for some T > 0, finite or 
infinite, set 

rv = U v(t), fl t = \^J 

te(o,T) te(o,T) 

Assume Ft is smooth and has a local intrinsic representation as <f>(x,y;£) = 0 in 
proximity of each of its points P Q = (x 0 ,y 0 ) at time t Q , for a function <$, smooth 
in the neighborhood of (P 0 ,£ 0 ), and such that |V$| ^ 0 on F. A material point 
P(t) = (x(£), ?/(£)) moving along its Lagrangian path starting from P Q at t — t Q , 
remains on F since the fluids are immiscible. Thus $(P(£); t) = 0 for t close to t Q . 
Computing the total derivative gives • P + = 0, i.e., 



N • P = —N f where N 






and N f 









(4) 



The function <I> is chosen in such a way that (N, N t ) points toward the fluid of 
viscosity fi\. The material point P belongs to both fluids so that P = Vj, j = 1, 2. 
Combining this with (2), we get 

V 1 -N = V 2 -N = -N u i.e., Vp! • N = A— Vp 2 • N = N t . (5) 

12/ii 12/i 2 



3. Receding Blobs, Expanding Bubbles and Advancing Fingers 

Assume that the Hele-Shaw cell occupies the whole 1R 2 and that the viscosity of 
one of the two fluids is negligible with respect to the other, say for example /i\ <C /12 
(for example air or water versus oil). Then /ii « 0 and from the definition of ip\ 
in (2) one has pi = 0. By (3) one has P 2 = 0 on F(t) for all t , and the motion is 
determined by f l(t) and the pressure P 2 within it for all t £ (0, T). Setting u = <p 2 , 
the problem consists in finding a family of sets (0, T) — ► Fl(t) C IR 2 with smooth 
boundary, and a measurable function u defined in IR 2 x (0,T), such that 

(x, y) — > u(x, y, t ) £ C 2 (Fl(t)) D C l for all t £ (0, T) 

and u satisfies 

Au = 0 in Fl(t); u = 0, Vu • N = — N t on F(t) Vt£(0,T). (6) 

The fluid is receding if C Ft(ti) for t\ < t 2 and it is expanding if the converse 
inclusion holds. 



2 Instantaneous equilibrium implies pi — P 2 = tk, where r is the surface tension and k, is the 
curvature of T(t). 
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3.1. Receding Blobs 

For all times the fluid occupies a bounded domain in 1R 2 and surrounds a “core” or 
sink G with smooth boundary <9G, through which fluid is removed (see Figure 1). 




Figure 1. Bubble of oil into water 



The suction occurs at constant rate Q > 0 and causes the fluid blob to recede. 
Denoting by v the unit normal to dG directed towards the fluid, 

Vu • v = — Q on dG (pressure forced to decrease along dG). (7) 

It is experimentally observed that the suction process ends with the abrupt forma- 
tion of irregular fingers in finite time. Repeating the experiments with the same 
initial configuration $7(0) of the blob (“same” within the limits of experimental 
errors), it is observed that the extinction time and the configuration of the ruptu- 
ring fingers are dramatically different. The problem is physically ill-posed in this 
sense. It is then natural to wonder whether there are initial configurations $7(0) of 
the blob for which the suction process ends smoothly at some finite time T, i.e., 
with no abrupt rupture into irregular fingers. More generally, if one prescribes a 
final configuration of fingers, is there an initial blob fi(0) for which the suction 
process ends abruptly by rupturing into that finger configuration? 

The boundary of Q(t) consists of two portions. T(£), the first portion, is 
the boundary separating the more viscous fluid from the less viscous one, and 
the second portion is dG. Since u is harmonic in Cl(t), vanishes on T(t) and has 
exterior positive flux on dG , by the maximum principle we get [u > 0] in the fluid 
and Q(t) = [u(t) > 0]. Assume that this problem has a smooth solution u , positive 
within the fluid, null on the interface T and such that \Vu\ 7^ 0 on T. Then, taking 
<f> = —u in (4) and in the third relation in (6), we have 

-\Vu\ 2 = u t on T(t) for all t G (0, T). (8) 

Since u t is harmonic in $7(£), it has zero flux on dG and it is negative on T, by the 
maximum principle u t < 0 in Cl(t) for all t € (0,T). Therefore the blob is receding 
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in the sense that Cl(ti) D £2(^2 ) for t\ < £ 2 - The mathematical formulation is 

An = 0 in Q(t); [ u(t ) > 0] = Q(t) £ (0 ,T); 

<u = 0 and Vu • N = — N t on T(t) Vt£(0,T); (9) 

Vu • v = — Q on dG for some fixed Q > 0 and \/t £ (0,T). 

The initial datum is the domain fi(0) occupied by the fluid, as the suction process 
starts. We stress that the initial datum is not the pressure w(0) at time t — 0, but 
the set where such a pressure is negative, i.e., |/u(0) > 0] = fi(0). The extinction 
time T is one of the unknowns to be determined. 

3.2. Expanding Bubbles 

Let (0,T) — > B(t) be a family of bounded, open sets in 1R N . For t = 0 the 
set L?(0) is bounded, open and connected and represents an initial bubble of air 
surrounded by fluid which entirely fills the remaining part of IR N (see Figure 2). 3 
The domain Q(t) occupied by the fluid is 1R N — B(t) and T(t) = dB(t). If the 




Figure 2. Bubble of air into oil 

bubble B(t) expands it pushes the fluid to |x| — > 00 . It is observed that the process 
ends abruptly with the formation of disordered fingers. In this case as well, the 
same bubble L?(0) (“same” within the experimental errors) may generate, in real 
world experiments, different extinction times and different finger configurations. 
A natural question is whether there exist bubbles B(Q) for which the whole fluid 
is pushed smoothly away to infinity. 

The potential u = ip 2 is harmonic within the fluid and the fluid “outflows at 
infinity” essentially like a Newtonian potential, i.e., 

Vu(x; t) = — Q VFjv(x) + 0(\x\~ N ) as \x\ — > 00 for all £e(0,T), (10) 

where Fn is the fundamental solution of Laplace’s equation in 1R N , N > 2 and Q 
is a positive given number. Thus the fluid tends to flow out to infinity. Since u — 0 
on T (t) and is harmonic in by the strong maximum principle u > 0 within 

3 Although the problem arises physically in ]R 2 we set it in R N in view of its connection with 
A-dimensional homeoids. In this context x = . . . ,xjy). More generally, one has a bubble of 

fluid of smaller viscosity fi\ which has been neglected. 
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the fluid and [u(t) > 0] = Q(t) for all t G (0,T). If the problem has a sufficiently 
smooth solution with smooth free boundary r(t), then |Vw| / 0 on T and one 
might take $ = u in (4) to find a relation identical to (8) in N space variables, 
and conclude that u t < 0. Thus the fluid is receding towards infinity and the air 
bubble B{t) is expanding. We have 

{ Au = 0 in Q{t) =1R N - B(t ); [u(t) > 0] = Q(t) \ft G (0 ,T); 

u = 0 and Vu • N = -N t on T{t) V*g(0,T); (11) 

\7u = — Q VFjv(x) + 0(\x\~ N ) as \x\ — > oo V£g(0,T). 

The initial datum is B(0), i.e. the domain occupied by the air bubble at time t = 0. 
Once again, the rupturing time T is one of the unknowns to be determined. 

3.3. Advancing Fingers 

Here the Hele-Shaw cell is an infinite strip S = 1R x (—1,1) and the fluid of 
smaller viscosity fi\ (water for example) penetrates the more viscous fluid (oil for 
example), in the form of a finger, whose asymptotic, vertical section, as x — > — oo, 
is an interval (—A, A) C (—1, 1) for some A G (0, 1). The shape of the finger and A 
depend on t. It is assumed that, at each time £, the penetrating fluid advances with 
constant velocity U i. Therefore its velocity potential is ipi = Ux + const , where 
U might depend on time. In this Subsection, we do not stipulate the assumption 
Pi = 0; let us note that even in the former cases it was not strictly necessary. 



J^V 




Figure 3. Symmetric Advancing Finger 



Let be the asymptotic velocity of the displaced fluid for x — > =boo. Since 
both fluids are incompressible, 

0 = [[ div V 2 dxdy = 2F+ - 2(1 - X)V 2 ~ + 2 f V 2 • N da 

J J oil Jr+ 



a. 



div Vi dxdy = -2A U -2 f V x • Nd<r. 

water J T+ 



Using the first relation in (5), this implies V 2 =U A + (1 — A)U 2 - By the second 
of (5), there holds as x — > — oo on T(t) 

<P 2 — ► —(fi, i.e., Vf = — U, and V 2 + = UX + ^-(1 - A )U. 

M2 M2 M2 

The velocity potential — Ux + const is only defined in the region occupied by 
water. We regard it as defined on the whole strip S by extending it to be the same 
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form. Then, motivated by (3), we set 

(l — — ^ u = - — -—(p 2 ~ Pi) — P 2 ~ —<fi on th e se ^ occupied by oil. 

V p 2 ) 12 M2 

The function u is harmonic on such a set and u = 0 on F. Moreover, 
lim Vu = XU i and lim Vu — 0. 

x — ► oo x — >• — oo 

From the first of (5), 

Vu-N = -N t on T(t) We(0,T). 

Since the edges of the cell are sealed, V 2 • j = 0 for y = dbl, i.e., u y (x, ±1) = 0 for 
all x E IR. Collecting these remarks, the problem reduces to finding: 

1. a domain ft C S x iR + whose boundary is a space-time surface T and 
whose t-sections T(t) are in the form of fingers; 

2. a function u(x,y,t) defined in Q and such that, 



Au = 0 in Q(t) VtE(0,T); 
u = 0 and Vu • N = —N t on T(t), \/t E (0,T); 

< u y (x,±l,t) = 0 for all x E IR and all t E (0,T); 

lim = (AC/,0) for all y E (-1,1) and Vt E (0,T); 

x — > 00 

lim Vu = (0,0). for all y E (-1, 1) and Vt E (0,T). 



(12) 



If u is a smooth solution of (12) with smooth T, by the maximum principle u > 0 
on ft and u — 0 on the free boundary T. Thus, by extending u to be null inside 
the finger, we may identify the domains occupied by the two fluids as 



[u — 0] = {water}, [u > 0] = {oil}, [u(t) > 0] = Cl(t) for all t E (0,T). 



The initial datum is Q(0) or equivalently the initial finger. Thus in particular 
the interval (—A, A) is prescribed. The mathematical formulation in (12) does not 
require the finger to be symmetric, nor does it depend on such a property. The 
problem could be mathematically posed as in (12) by starting with an initial finger 
whose asymptotic vertical section as x —> — 00 is a non-symmetric interval as in 
Figure 4, or even a measurable set E C (— 1, 1) of given measure p(E) = 2A. 




Figure 4. Non Symmetric Advancing Finger 




Hele Shaw 



35 



Denote by y — /(x,£) the local representation of the curve T (t), and let t be 
its unit tangent. The unit vector — (N, N t ) — — (N x ,N y , N t ) points inside the more 
viscous fluid. From the first of (5), Figure 3 and the definition of u , we compute 

Vu • N = UN X and N t = -UN X on T(t), V* G (0,T). (13) 

Therefore the third equality in (12) can be equivalently replaced by (13). 



4. Distributional Formulations 

These pointwise formulations differ from each other only on the requirements on 
the fluid set fi(t) and on the boundary conditions. However, they locally have 
the same structure. Namely, the fluid is identified as the set where u > 0, and u 
vanishes on the interface T(t) separating the two fluids. For this reason, the partial 
differential equation and the fluid set where it is required to hold are linked by the 
same distributional formulation which we derive next. We define 



H{u) 



1 

[ 0 , 1 ] 



if u > 0 
if u = 0. 



(14) 



If the time T, finite or infinite, has been determined, choose any 
ip G C 0 °° ({Hele-Shaw cell} x (0,T)) , 
and compute, in the sense of distributions, 4 

(A u, (p) = (u, A (/?) — uAipdxdt = uVcp • N da— 

J J Dj, J i/ r j 1 

// Vu'V(pdxdt = — 'Vu-'N(pda+ // A uipdxdt 

J J J J r ^ J J O'j 1 

= JJ N t ip da = J J <p t dxdt = (H(u),<pt) = -(J^H(u),vj. 

Therefore, 

d 



dt 



H(u) + Au = 0 in V' ({Hele-Shaw cell} x (0, T)) . 



(15) 



This includes the partial differential equation in (9), (11), (12), as well as the 
conditions on T(t) represented by the second relations of (9), (11), (12). The 
remaining conditions still hold and lead to the following distributional formulation 
of these problems. 



4 The Hele-Shaw cell could be the whole ]R 2 as in § 3.1, or the whole R N as in § 3.2, or the infinite 
two-dimensional strip S' as in § 3.3. The distribution pairings and calculations are formally the 
same for all these occurrences, x will denote a TV-dimensional variable as well. 
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4.1. Receding Blobs 

Although the physical problem arises in IR 2 we formulate it in 1R N since its math- 
ematical structure does not depend on the dimension. 

'«£ L 2 (0,T-,W 1 ' 2 (R N )nW 2 ’ 2 {[u{t) >0])); 

< j^ff(w) + Au 9 0 in V'(]R N x (0 ,T)); (16) 

Vi/ • z/ = — Q on dG for a.e. t G (0,T). 

Here Q is a given positive number and v is the unit normal to dG pointing towards 
the fluid. The inclusion in (16) is meant in the sense of the graphs. 

4.2. Expanding Bubbles 

'u 6 L 2 (o,T-w^(R N ) n w 2 ’ 2 ([u(t) > 0])); 

< j t H{u)+Au3 0 in V(lR N x (0,T)); (17) 

Nu = -QVFjv(x) + O (|x| -iV ) as \x\ — > oo a.e. t G (0,T). 

4.3. Advancing Fingers 

The Hele-Shaw cell is a 2-dimensional strip S and we will formulate the problem 
of advancing fingers in such a 2-dimensional setting. For n G IN let S n denote the 
truncated half strip (— oo,n) x (—1,1). Then we look for a function u : St — ► IR + 
such that, 

f u G L 2 (0, T; W 1,2 (S n ) D W 2,2 ([u(t) > 0]nS„)) Vn G IN; 

(u) + Au 9 0 in P'(Sx (0, T)) ; 

< u y (x,±l,t) — 0 for all x £ IR and all t £ (0 ,T); (18) 

lim Vw = (XU, 0) for all y £ (-1, 1) and V« € (0 ,T); 

X OO 

lim Vu = (0,0) for all y G (—1,1) and Vt G (0,T). 

< x — > — oo 

From (13) and the distributional computation (15), 

j t H(u) = ~U^H(u) in V'(S x (0 ,T)). (19) 

This implies that the second of (18) can be equivalently rewritten as 

r\ rv 

S^.H(u) - (1 - S)U-^H(u) + Au 9 0 
in V(S x (0,T)), for all 0 < <5 < 1. 



( 18 ') 




Hele Shaw 



37 



5. Receding Blobs 

Let u be a classical solution of (16). Since the fluid is receding, setting 

v(t) = ^ u(r)dr , one has H(u) = H(v). (20) 

Therefore (16) is transformed in terms of v as, 

Av G H(y) - £ where £ G H(u(T)). (20') 

The inclusion is in the sense of the graphs and (15) is required to hold for all 
t G (0, T). The time T is the extinction time so that u(-,T) = 0, smoothly or 
abruptly. The selection, 

a-) e H{u(;T)) = H( 0) = [0,1], 

can be interpreted as a description the extinction profile and it is called the ter- 
minal phase. Integrating (20') in dx over 0(t) gives, 

Q(T-t)\dG\= [ {H(v)-£}dx and QT\dG\ = [ {l-£}dx. 

Jn(t) o) 

This relation links the initial datum JI(0), the extinction time T and the extinction 
profile £. Given an initial blob O = 0,(0) and a measurable selection £(•) C [0, 1] 
in the sense of the graphs, we seek a positive time T and a non- negative function 
v such that 

'v e C([0,T] : W 2 ' 2 (n) n W 0 ll2 (fi)); 

A v£H(v)—£ a.e. inO, £ (0,T); 

< Vvu = -Q(T - t) on dG, Vi e (0, T); ( 21 ) 

v(x,T) = 0 and QT\dG\ = [ (1 -£)dx. 

Since the fluid recedes [v(t) > 0] = 0(t) C O for all t G (0, T). The flux condition 
on dG is meant weakly although it is shown in [8] that Vv is Holder continuous 
in O x (0,T), so that such a condition is actually classical. 

As for £, we may assign it in the following way. Select a measurable set E C O 
to serve as a description of the geometrical configuration of the final fingers and 
put £ = X£. The set E could be the finite or infinite union of disjoint subsets of 
O — G , each being a finger. If E = 0, then £ = 0 and the extinction occurs smoothly 
without abrupt formation of fingers. 

5.1. Relating the Initial Blob to the Final Finger 

Given a final finger configuration £, we wonder whether there exists an initial blob 
0 and a time T such that the suction process ends at time T with the formation 
of the terminal phase £. Such initial configurations fl(0), if existing, are confined 
within a rather small class of sets, which we will now identify. 5 



5 One can expect the initial configurations f2(0) to be in a narrow class by virtue an heuristical 
argument of analogy with the backward heat equation. If in (16) H(u) = u, one would have 
the backward heat equation, and fixing a “final” datum u(-,T), one would ask for the “initial” 
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Lemma 5.1. //{£!, T, £, u} is a weak solution of (21), then Vv(-,0) -N = 0 on d£l, 
where N is the outward unit normal to <9£1. 



Proof. Since v(-,0) > 0 in £1 and it vanishes on <9£1, one has Vv(-,0) • N < 0 on 
<9£1. Integrating the equation in (21) for t — 0 gives 

0 > / Vv(x, 0) • Nda > 0. 

Jon 



□ 



From this we deduce 

Corollary 5.2. // {£1,T, £, v} is a weak solution of (21), then V = v(-,0) is the 
unique solution of the variational inequality, 

AV < 1 — £ 

E>0 

V(AV-l + £) =0, 

\yv v = -QT on d G. 

Moreover, £1 = [V > 0] . 

The Corollary asserts that if (21) has a solution, then the initial blob must be 
concentrated on the support of the solution of (22). The solution V determines its 
own support from (22). Thus £1(0) cannot be prescribed in an arbitrary manner. 

The following Theorem, proved in [8], establishes the converse. 

Theorem 5.3. ([8]) Fix £ = yx: and A > 0 and let £1(£, A) be the open set [V£ ? a > 0] 
where is the unique solution of (22) for £ and for QT = A. Then there exists 
a unique solution {£1,T, £,v} of (21) with £1 = £1(£, A) and T = X/Q. 

This characterizes completely the class of initial domains for which (21) ad- 
mits a solution. Given any “terminal finger” E and the corresponding solution of 
(22) for £ = xs and some A > 0, the fluid recedes to the finger E in time T = X/Q 
if and only if the initial blob is confined in the complement of the coincidence set 
of the variational inequality (22) for this choice of parameters. 

If E = 0, i.e., if £ = 0, then the variational inequality (22) gives a way of 
choosing the initial blob for which there is smooth extinction, in time T, with no 
finger formation. We stress that the smooth extinction and the smoothness of the 
initial blob £1(0) are not related. Only the relations in (22) with £ = 0 prescribe 
the structure of the initial blob £1(0) in such a way that smooth extinction may 
occur. In [8] we give examples showing that any finger configuration, however 
irregular, can be achieved starting from an initial blob £1(0) whose boundary is 
nearly spherical and analytic. 



a.e. in 1R N — G 



(22) 




datum u(-,0) yielding the prescribed configuration u(-,T). Such a datum, if any, would have to 
analytic in Q. 
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6. Expanding Bubbles 

Let E be the final configuration of the set occupied by the fluid; equivalently, 
B(T) = 1R n - E is the final configuration of the bubble at time T. Such extinc- 
tion/rupture time T might be finite or infinite irrespective of the regularity of E. 
For example if E = {|x| > R}, the fluid might be smoothly displaced along of a 
sphere {|x| = r(t)}, and it might abruptly disappear at some finite time T when 
r(T) = R. In the other extreme case E = 0, the bubble smoothly displaces the 
fluids for all times and it invades the whole IR N as t — > oo. The problems (17) for 
finite or infinite T have a different nature. 

6.1. Expanding Bubbles Bursting in Finite Time T 

Having fixed a final configuration E, we let £ = xs and we keep on denoting by 
£1 = £1(0) = IR n — B( 0) the initial fluid set for the initial bubble B( 0). Defining v 
and H(v) as in (20), the new unknown v satisfies (20') in V r (IR N ) for the choice 
£ = xe- Integrate this equation over a ball B p of radius p centered at the origin, 
use the asymptotic condition in (17), and let p — > oo and £ — > 0 to obtain, 

QT = [ (1 - £)dx = |£1 - E| = | IR N - E| - |B(0)|. (23) 

Jn 

The initial bubble B( 0) is confined within a ball {|x| < R} centered at the origin. 
Then, for a burst or extinction in finite time the measure of 1R N — E must be 
finite. Summarizing, given a final configuration £ = and an extinction time T 
one seeks an initial configuration £1 of the fluid, or equivalently an initial bubble 
B{ 0) = IR n — £1, and a non-negative function v such that 

veC([0,T};W^(m N )),v>0; 

Av e H(v)-£ a.e. in M N , Vt G (O.T); 

Vn = -Q(T-i)VF A r(x) + 0(|xr iV ) as |x| — > oo, W € (0,T); ( 24 ) 

v(x,T)=0 and QT = [ (1 - £)dx = - E|. 

J n 

If v is a weak solution, B(t) = [v = 0] for all t G (0, T). Moreover, 

a = [V > 0] where V = v(-,0), and A = QT = \Q - E|. 

Arguing as before, V is a solution to the variational inequality, 

' ( AV < 1 - f 
J V >0 

' lv r (AV-l + 0=0, 

VV- = -AVF n (x) + 0(|ar| _iv ) as |x| -» oo. 

Theorem 6.1. [9] For every fixed final configuration E and A > 0, there exists a 
unique solution of the variational inequality (25). As a consequence, there exists a 
unique weak solution of (23) with T — X/Q. 
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Thus, given a final fluid configuration £ and A > 0, the expanding bubble 
problem (23) has a solution with QT = A|fl — E| if and only if the initial bubble 
H(0) is the coincidence set of the variational obstacle problem (24). In such a case, 
f 1 = 1R n — B( 0). The proof in [9] makes use of potential representations in terms 
of the fundamental solution Fn(-). The proof also shows that 

v(x,t) = [ {F N (y - x) - F N {x)}dy. (26) 

J o-s 

Moreover B(t) = [v = 0] for all t G (0,T), and, 

\B(t)\ = |B(0)| +Qt for all t G (0,T). (27) 



6.2. Extinction in Infinite Time 



The same transformation as in (20) has compactness problems; also (23) is not well 
defined. However, we can carry out the same procedure to construct approximating 
solutions. Fix an asymptotic fluid configuration £ such that \IR N — £| = oo. Then, 
for n G IN set 



E n = £u{|x| > n}, £ n = Xx:„, A n = \1R N - E n | - ji, (28) 



where /x is a fixed positive parameter to be chosen. For these choices, one solves 
the approximating problem, 



'v n eC([0,T\;W™{m N )), v n >0 ; 

Av n G H(v„) - £„ a.e. in JR N , 'it G (0,T n ); 

< Vn„ = -Q(T n - f)VFjv(x) + 0(|x| _N ) as |x| — > oo, it £ (0 ,T); 
v n (x,T n )= 0, T n = X n /Q, O n = IR N - E n - B n (0); 

QT n — [ (1 - £ n )dx = |f2 n - E„|. 

Jn ri 



This is a family of variational inequalities (parametrized with t G (0, T n )), with 
coincidence sets B n (t) = [v n (t) = 0]. Moreover, Q n = [u n (-,0) > 0]. For each 
n G i/V, this problem has a unique solution v n , which in addition satisfies, 

V n (x,t) = [ {F N (y - x) - F N (x)}dy ite(0,T n ). (30) 

J R N — T, n — B n (t) 



Also, 



\B n (t)\=fi + Qt Vt G (0, T n ). (31) 

We briefly comment on the positive parameter /x. Comparing (31) with (27) and 
keeping (23) in mind, such a parameter prescribes the measure of the initial bubble 
B n ( 0) and keeps it fixed as n — > oo. Notice that only the volume and not the shape 
of the initial bubble are specified. Thus when n — > oo in (29) one expects to get 
a solution of the expanding bubble problem with infinite extinction time and for 
a prescribed volume of the initial bubble. Such a solution will then depend on an 
arbitrary positive parameter fx. This dependence is not surprising since T = oc 
and it corresponds to an arbitrary choice of the initial time, as it appears from (27) 
and (31) for n — > oo. Letting n — > oo in (29) hinges upon some a priori estimates 
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on { , u n }, which are derived in [9] under the assumption that the fingers in E are 
not too “thick” , in the sense 



jf 



|En{|x| = p}| 



dp < oo. 



a KM = p}l 

Assuming such a “non-thickness” condition of the terminal phase E, the sequence 
{v n } is equi-bounded in C^(1R N ) and a strong pointwise limit can be taken in 
(29). Referring to [9] for the technical arguments, the resulting limiting problem 
is 



'veC(M+-,W^(]R N )),v>0- 

< Av g H(v) - £ a.e. in IR N , Vt G 1R + ; (32) 

K v(x,t) < C\x\ 2 as |x| — > oo, G iR + , 



for a positive constant C independent of t. The last two of (29) lose meaning as 
T n — > oo, whereas the asymptotic behavior as |x| —> oo yields the last of (32). 
This is established in Theorem 4.5 of [9], page 39, by a suitable expansion of 
Fjv in terms of harmonic polynomials as \x\ — ► oo. This limiting problem is a 
family of variational inequalities parametrized by t G iR + and for each t the 
coincidence set is B(t) = [v(t) = 0]. By a classical stability result of coincidence 
sets of variational inequalities, the Hausdorff distance between B n (t) and B(t) 
tends to zero as n — > oo, for all t > 0. It follows from this fact and from (31) that 



\B(t)\ = p + Qt for all t > 0. 



(33) 



The variational inequality (32) for t = 0 completely characterizes the initial bub- 
bles B( 0), of measure //, yielding the terminal finger configuration E in infinite 
time. These are the coincidence sets of the solution of (32) for t = 0. 



6.3. Smooth Extinction in Infinite Time 

An interesting characterization of the initial bubbles B = B(0) can be given in 
the case when the fluid is displaced to |x| — » oo smoothly at all times. Thus, there 
are no terminal fingers and E = 0 and £ = 0. Accordingly, 1R N — E n is the ball 
{|x| < n} and (30), for t = 0, takes the form, 

V(x)= lim f {F N (y - x) - F N (y)}dy (y = v(-,0)). 

n ^ CO J{\y\<n}-B 

Write this for x e B so that 

lim [ {F N (y - x) - F N (y)}dy = 0. (34) 

n ^ 00 J{\y\<n}-B 

Coming back now to (29), we write it for t = 0 and V n = v n (-,Q). In the resulting 
problem, we perform the change of variables x — > x/S for a parameter 5 > 1 and 
we set 

V n ,5 = d 2 V n (x/S), T n j = 6T n , B Ui s = SB n ( 0). 



Vnj(x) = [ 

J {ly^ndj-SB^iO) 



{F N (y -x)- F N (y)}dy = 0. 



Then, 




42 



D. Andreucci, G. Caruso, and E. DiBenedetto 



Computing this for x E B(0) and letting n — > oo gives 

lim f [F N {y - x) - F N (y)}dy = 0. 

Thus, taking the difference with (34), 

[ {F N (y - x) - F N (y)}dy = 0, V<5 > 1. (35) 

J5B-B 

This is a geometric condition on the initial bubble B — B( 0) for a smooth ex- 
tinction to occur in infinite time. It says that the Newtonian potential generated 
by a uniform distribution of masses on the “shell” SB — B is constant inside the 
cavity B. Such domains are called homeoids (see Figure 5). A theorem by Newton 




Figure 5. Constructing Homeoids 



asserts that ellipsoids are homeoids ([17]). 

The converse is also true, that is homeoids are ellipsoids. The proof for N = 2 
is due to Kellogg [17] and in dimension N = 3 the proof is given in Dive [10]. The 
proof is calculational, in the sense that makes extensive use of potential representa- 
tions. A different approach is taken in [4], where a proof is developed by exploiting 
a structure result for solutions to variational inequalities. It is shown in [9] by an 
indirect argument that the statement is indeed true in every dimension N. Thus 

Theorem 6.2. Every N -dimensional homeoid is an ellipsoid and viceversa. 

Returning to the Hele-Shaw problem of expanding bubbles, we can conclude 
that if the initial bubble is a homeoid, then the fluid is removed smoothly in infinite 
time. Viceversa, if the extinction occurs smoothly in infinite time, then the process 
must have started from a homeoid. Equivalently, 

Theorem 6.3. A classical solution of the Hele-Shaw problem in 1R N of displacing 
a fluid by an advancing bubble and with empty final phase exists if and only if the 
initial bubble is an ellipsoid. 
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7. Advancing Fingers and the Selection Problem 

We look for a travelling wave solution of the form u(x — Ut) for the problem of 
advancing fingers in its classical formulation (12)— (13). Relabelling by x the new 
variables, (12) reduces to finding a domain ft C S whose boundary is a curve T in 
the form of a finger and a function u defined in ft and such that 

r Aw = 0 in ft, and u = 0 on T; 

Vu • N = —U cos 6 on T; 

< u y (x, ±1) = 0 for all x G IR; (36) 

lim Via = (AC/, 0) 

lim Vw = (0,0). fOT aU V £ ( -1, ^ 

X — ► — OO 

Experiments consist in injecting water into an oil-filled Hele-Shaw cell from one 
of its “sides”, say at x = — oo. It is observed that after an initial formation of 
several incipient and invading fingers, the penetrating fluid reaches a steady state 
and takes the form of a single, symmetric finger as in Figure 3. Thus the formu- 
lation (36) can also be regarded as a steady state configuration of (12). The weak 
formulation of (36) is derived from (18) by either seeking traveling wave solutions 
of (18) or by imposing (19) and seeking steady state solutions. In either case, its 
weak formulation takes the form, 

9 u € W^ 2 (S n ) H W 2 ’ 2 ([u > 0] H S„); 

A 

U -^H(u) - Au 3 0 in V'(S); 

< u v (x,±.\) = 0 for all x € 2R; (37) 

lim Vu = (XU, 0) 

X lim V« = (0, 0). all y €(-1,1). 

v x — ► — oo 

Assuming that the finger is symmetric, Saffman and Taylor [29] produced an im- 
plicit family of solutions of (36) parameterized by A G (0,1), where 2A is the 
asymptotic width of the finger as x — > — oo. The equation of the curve limiting the 
finger can be computed explicitly as 

x (y) = ( 1 + cos t )}' ( 38 ) 

More generally, one might prescribe a generic interval of width 2A, not necessarily 
symmetric about y — 0, with horizontal bisecting line y — y 0 7^ 0, say for example, 

{a, (3) C (-1,1), (/?-<*) = 2A; a = y Q - A, (3 = y Q + A, (39) 

and solve (36) by requiring the asymptotic cross section of the finger to be the 
interval (a,/3). If (3 ^ —a the finger would not be symmetric and its nose would 
be located at some (£*,y*) ^ (0,0) (see Figure 4). Using A and y Q as parameters, 
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Saffman and Taylor [29] computed explicitly such profiles as 



x(y) = - — - In 



7 r 






1 4- cos 



n(y - y 0 ) 

x 



+ — In ^ 

7T 



. . . * y- y 0 } 

1 + Sin 2 A 

1-sin?^ 



} (40) 



In this formula, the origin of the x-axis is taken in such a way that (0, y Q ) is on 
the finger, even though not necessarily coinciding with its nose. For y 0 = 0, this 
reduces to (38). Saffman and Taylor computed solutions for non-symmetric fingers 
as the limiting case of a deformed and elongated bubble moving in the Hele-Shaw 
cell. However, asymmetric fingers are not observed in experiments. The fingers 
occurring in experiments are symmetric and, for a wide range of the asymptotic 
speed U, their observed asymptotic width is \ of the width of the channel. In our 
setting A = | (see [29, 30]). Thus it appears that out of the families (38) and (40), 
Nature selects the symmetric profiles with the parameter A = 

The ill-posedness of this problem is of different nature than for receding blobs 
(16) and expanding bubbles (17). In the context of advancing fingers, families of 
solutions can be found in an explicit form but one does not know which one is the 
“good” solution. The physical principle that would favor the solution correspon- 
ding to A = \ is not yet fully understood. 

The original approach of Saffman and Taylor was based on conformal map- 
pings and separations of variables in the conformally transformed problem. Most 
of the investigations on this problems have followed the same pattern or are of 
a numerical nature. The wide range of analytical and numerical attempts to the 
problem ([1,4,10,13,14,17,19]) confirms that the symmetric solution A = ^ plays 
some special role. Nevertheless, to our knowledge a clear and simple principle ex- 
plaining the ill-posedness of the problem still eludes the efforts of the investigators. 

A different approach is adopted in [11], where attention is drawn to a selection 
problem posed in a semi-infinite domain, essentially obtained assuming that the 
displaced fluid has a second free boundary on the right, open to the atmosphere. 
It is shown in [11] that the only admissible translating solution to this problem 
corresponds to the A = 1/2 Saffman-Taylor curve. The author argues that the 
case of the right bounded strip is the one physically relevant, and that an infinite 
strip does not represent the experimental setting of a very long body of fluid. 

We have approached the problem from a more theoretical point of view along 
the lines of [8, 9]. While far from having solved the problem, we have identified 
elements of the asymptotic behavior of the solution as x — > oc which seem to have 
physical significance and which are maximized for A = 



8. Transforming the Problem into a Variational Inequality 

To investigate how the asymptotic shape of the finger as x — » — oo affects the 
solution, prescribe any measurable set E C (—1,1) with \E\ = 2A for some A E 
(0, 1) and attempt to solve (37), by requiring the domain occupied by the water 
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to have asymptotic x-section Basx^-oo. We also ask the set occupied by the 
less viscous fluid to be finger-like , i.e., for all 77 G (—1,1) the line y = 77 intersects 
the boundary of the set [u > 0] in at most one point. Any collection of fingers is 
a finger-like domain. Nonetheless, the notion is more general as it is independent 
of the geometry and the regularity of the free boundary [u = 0]. In particular, it 
may include any disordered penetration of water into oil, provided the penetration 
occurs in the direction of the x-axis. Assuming a smooth solution exists, set 

v(x,y) = y f u{s,y)ds, and A = (-1, 1) - E. (41) 

The finger-like structure of the free boundary implies that v is positive or zero 
whenever u is positive or zero. Thus, by formally integrating the equation in (37) 
over (— oo,x) we obtain the problem examined in ([3]) 

' v G W 2,2 (5 n ); v > 0 in S n for all n G IN; 

Av G H(v) - XAxR in V'(S); 

\E\ = 2A > 0 with 0 < A < 1; 

< v y (x,±l) = 0 for all x G 1R; (42) 

lim sup v(x, y) = 0; 

x -*-°°\y\<i 

sup v(x,y) < Cx 2 for x 1, 

<\y\ < 1 

for some positive constant C. The problem has a formal analogy with (32), i.e. a 
bubble expanding smoothly into oil, for all times with no rupture. In particular 
the last of (42) exhibits the same upper bound on the growth of the solution 
as the last of (32). The basic difference is the form of £. In (32) it represents 
the characteristic function of the terminal set occupied by the fluid. In (42) it 
is the characteristic function of the complement of the smallest horizontal strip 
containing the finger. While the set A x 1R is occupied by the fluid, it is not the 
whole fluid set. All the requirements in (42) are essential. In [3] counterexamples 
are given showing that if any one of them is removed (for example positivity, or 
A = 0), the resulting problem while admitting a solution does not translate the 
physical information contained in (36)-(37). In particular the requirement |J57| > 0 
allows one to generate solutions with a “finger” . 

Theorem 8.1. There exists a solution to (42). The solution is unique up to hori- 
zontal translations. 

Construction of solutions hinges upon a priori estimates and upon identifying 
the finger through the abscissa x* of its “nose” . This is defined as 

x* = inf {x G IR | w(x,y) > 0, for all y G (—1,1)} . 

It turns out that the requirement \E\ > 0 is essential to identify x*. It is shown in 
[3] that no solution exists if \E\ = 0. Thus the Hele-Shaw problem of advancing 
fingers has a solution if and only if | jE7| > 0. However, the physical problem is 
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invariant by an horizontal shift of the x-axis. Equivalently, it is independent of the 
abscissa of the “nose” of the finger, provided one can be found. Thus, uniqueness 
up-to-translations is the only correct notion of uniqueness. 



9. Asymptotic Behavior and the Area Function 



Uniqueness requires a precise description of the asymptotic behavior of the so- 
lutions of (42) as x — > oo, in terms of the notion of area function, which we 
describe next. Choose a coordinate system so that the nose of the finger has coor- 
dinates (x*,y*). The notion of area function is best understood in the context of 
the Saffmann-Taylor solutions (40) originating from an asymptotic cross section 
E = (a, ft) as in (39). The area function as a function of x is the measure of the 
shaded region in Figure 6 when computed at the abscissa x* of the nose of the 
finger. 



E 



water 



{£*>$/*) 



x 



oil 



-1 



Figure 6. The Area Function 



For a general asymptotic cross section E C (—1,1), the area function is 
defined by 

a(x) 

a(x*) + (x — x*)\E\, if x > x *. 



*/ / 

JE J-c 



X[w>o\(s,y)dsdy if x < £*; 



(43) 



Such a function depends on the geometrical configuration of the finger only, and 
not on the values of the solution w. It can be determined from the knowledge of 
w from —(X) up to the tip of the finger. The values of w for x > x* are immaterial 
for the determination of the area function. A physical interpretation of the area 
function can be drawn from the formula 
1 /•! 

w x (x,y)dy = / u(x,y)dy for all x G 7R, 

l J - 1 




which is established in Corollary 7.4 of [3]. Interpreting u(x,y)dy as an elemental 
force acting in the x-direction, this formula suggests that a(x) is the thrust exerted 
by the fluid across the x-section in the direction of the advancing finger. Set also 

/ x \ [ y 

a{s)ds ; g(y) = g(-l) - - (i + yf + J {y - v)XE(v)dv, ( 44 ) 



where g(— 1) is a constant chosen in such a way that g(-) has zero average over 
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Theorem 9.1. Let w be a solution of (42). Then, for x^> 1, 

w{x,y) = ~{K X ~ x *f + a{x*)(x - x*) + 6(x*)} + g(j/) + 0(1). (45) 

Analogously, let u = w x be a solution of (37). Then for x 1, 

u(x, 2/) = A(x - x*) + ^ a(x *) + 0(1). (46) 

While negligible with respect to X(x — #*), the term containing a(£*) seems 
to be essential in determining a precise asymptotic expansion of w or u = w x 
as x — > oo. The value a(x*) is the thrust exerted by the fluid across the vertical 
section of the Hele-Shaw channel exactly at the nose of the finger. 

The asymptotic expansion (46) enables one to compute a(x*) in the case of 
the Saffman-Taylor solutions (38) and (40). For the symmetric fingers (38), one 
computes 

4U 

n(0) sy m = — A(1 — A) In 2 ( since x* = 0). (47) 

7 r 

Such an area is maximized for A = For non-symmetric fingers, the computation 
is better carried out in terms of the parameter y Q introduced in (39), and it gives 

tt(x*)non-sym = a(0) sym {l + f(X,y o )}, (48) 

where 

(ln4)/(A,t/ 0 ) = ln|(l + ^_) (l - y^) j- (48') 

For a fixed value of A G (0,1), i.e., for a fixed asymptotic width of the finger, 
regardless of its geometry, the function y Q — > f(X,y 0 ) is even, it vanishes at y Q = 0 
and is positive otherwise. Therefore it has a minimum at y Q = 0, i.e., for a = —/3. 
This implies that the corresponding finger is symmetric. 

Thus, among all the non-symmetric fingers of given asymptotic width, the 
symmetric ones are those with the minimal area. On the other hand, among all the 
symmetric fingers, the one of largest area occurs for A = f . The notion of area func- 
tion for symmetric fingers and its extremal property for A = \ had already been 
noticed by Saffman and Taylor in [28]. However, no connection had been drawn 
with the asymptotic behaviour of solutions nor with its physical interpretation as 
the thrust of the fluid at the nose of the finger. 6 



10. Motion by Mean Curvature as a Selection Criterion 

Let {T t } be a family of planar, regular curves parameterized by the time variable, 
say for example X — X{y, t) for y £ (y*, y* + A), for some y* E 1R and A > 0. The 



6 We would like to thank B. Matkowsky for pointing out reference [28] to us. 
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family {T t } moves by mean curvature if there exists a constant k such that 



X t = k 



Xyy 

l + Xl’ 



within its domain of definition. 



(49) 



If the family is of the travelling wave type, i.e., {X — x(y) + Ut} for some given 
constant U, then (49) takes the form 



d U 

— arctanx = 7 , where 7 = — . 
dy k 



By integration, 



c(y) = / tan{ 7 ( 77 - y„) + (*)dr] + x 

dy* 

|cos{7(y-2/*) +C*) 



= In 

7 



cos (* 



(50) 



+ X* 



where x* = x(?/*) and £* = arctan{x'(?/*)}. If any of the curves in (50) has to 
represent a finger for (36)-(37) with nose at (x*,?/*), we must have x'(y*) = 0 
and x'(y) — > —00 as y — ► y* + A. This implies that C* = 0 and 7 A = — 7 r/ 2 , i.e. 
k = —2 U\/tt. Thus, a curve of the sort of (50) moving by mean curvature would 
represent a finger for (36)-(37) only if it has the form, 



x(y) 



2X 



7T 



In 



cos 2\(y — y*) 



+ X* 



In particular x(y) is symmetric about y — y*. This implies that none of the non- 
symmetric Saffman-Taylor solution (40) represents a motion by mean curvature. 
Up to a suitable change of coordinates we take (x*, 2 /*) = (0,0). This gives 



2 A 7 r 

x(y) = — In cos — y ye(-X,X). 

7 r Z A 

Among the symmetric curves in (38), only the one corresponding to A = \ moves 
by mean curvature, since 

x ( ») = i^ 1 „{i( 1 +c o S ^)} 

From (49) it follows that the family of curves {x = x(y) + Ut} moves by mean 
curvature if 

Rcos0 = k/U, (51) 

where R is the radius of curvature to the curve T 0 and 0 is the angle between the 
normal to r o and the positive direction of the x-axis. Requiring a curve to satisfy 
(51) is equivalent to imposing that it moves by mean curvature. 

It was observed by Pitts [25] as a reading of experimental data that condition 
(51) forces a curve to be a finger in the setting described in § 3.3. Pitts, however, 
does not seem to have connected it to a motion by mean curvature. 

It is observed that for small values of £7, curves satisfying (51) fit the ex- 
perimental curves better than those of Saffmann-Taylor ([25] and [22] pages 462 
and 465). In fact, for small J7, the Saffman-Taylor solutions do not agree with the 



2A 7 r 

= — In cos —y. 
1 7 r Z A 
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experimental data ([29] page 326). This seems to suggest a selection mechanism 
of solutions based on motion by mean curvature of the free boundary. 



10.1. A Further Remark on the Motion by Mean Curvature 

While supported by the experimental data of Pitts, such a mechanism needs to 
be mathematically better understood. Indeed all the symmetric Saffman-Taylor 
curves (38) move by mean curvature in the following sense. Consider the family of 
travelling wave curves 

X x(y,t) = ^^ x (y) + Ui A €(0,1) 



where x = x(y) is the symmetric Saffman-Taylor solution (38). A straightforward 
computation that uses the explicit form of (38) gives 



X xt = k 






U 

k 



A 

1 - \ Xyy 



1 + 



A 



1 - A 



2 



7 r 

2A’ 



Note that, for any given A E (0, 1), this is precisely the condition for X\{y,t) to 
move by mean curvature. 
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Finite Element Methods for Surface Diffusion 

Eberhard Bansch, Pedro Morin, and Ricardo H. Nochetto 



Abstract. Surface diffusion is a (4th order highly nonlinear) geometric driven 
motion of a surface with normal velocity proportional to the surface Lapla- 
cian of the mean curvature. We present a novel variational formulation for 
the parametric case, develop a finite element method, and propose a Schur 
complement approach to solve the resulting linear systems. We also intro- 
duce a new graph formulation and state an optimal a priori error estimate. 
We conclude with several significant simulations, some with pinch-off in finite 
time. 

Keywords: Surface diffusion, fourth-order parabolic problem, finite elements, 
a priori error estimates, Schur complement, smoothing effect, pinch-off. 

AMS subject classification: 35K55, 65M12, 65M15, 65M60, 65Z05. 



1. Surface Diffusion and Mixed Formulation 

The overall goal of this work is to devise efficient numerical tools for simulating 
morphological changes in stressed epitaxial films and thereby study their compli- 
cated nonlinear dynamics. To model the misfit between the crystalline structure 
of the substrate and the epitaxial film, the film may be thought of as subject to 
mechanical stresses. This causes a plastic deformation of the free surface of the 
film. This morphological instability of the free surface may eventually lead to crack 
formation and fracture, an issue of paramount importance in Materials Science; 
see for instance [1, 4, 14] and also the list of references in [2]. 

The dynamics of the free surface T in R. d is governed by 

V = -A s (k + 6), (1) 

which is a 4th order highly nonlinear PDE. Hereafter, d € {2, 3}, V and k are the 
(scalar) normal velocity and mean curvature of T, respectively, Vs is the surface 
gradient, A^ = divs\fc the Laplace-Beltrami operator and e is the elastic energy 
density of the bulk ft(t) enclosed by T(t). We will take e = 0 throughout this 
paper. 

A number of issues arise, going from existence, well posedness, and regularity 
to effective algorithms for (1). The chief mathematical and numerical difficulties 
stem from the 4th order nonlinear operator A s& and the fact that one cannot 
work directly with the curvature vector R as in [8]. In contrast to [5, 10], we have 
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been able to derive from a suitable semi-implicit time discretization a variational 
finite element formulation for surfaces in which involves the four unknowns, 
namely the scalar curvature k, the curvature vector k, the normal velocity V , and 
the (scalar) normal velocity V, as follows: 

k = AsX, k, = k, • V, V = —A sk, V = Vv, (2) 

where V denotes the unit normal vector to T, pointing outward of the bulk enclosed 
by r. In view of this mixed formulation, which features only second order operators, 
our finite element method solely requires continuity of the discrete functions; in 
particular we do not need C 1 elements to handle curvature but can accommodate 
any polynomial degree. A Schur complement approach, with a symmetric and 
positive definite operator, is used to reduce the system (2) to the single unknown 
V. This allows for an efficient solution technique via preconditioned CG. 

This paper is organized as follows. We introduce the time discretization and 
natural variational formulation in §2, and present its finite element discretization 
in §3. We discuss the ensuing algebraic problem along with a Schur complement 
approach to its solution in §4. We present in §5 the graph formulation together 
with an error estimate. We document the performance of parametric and graph 
methods in §6 via several simulations which exhibit singularities in finite time. 



2. Time Discretization and Variational Formulation 

We now consider a semi-implicit time discretization of first order with time-step r, 
representing the next surface T n+1 in terms of the current surface T n as follows: 

X n+l = X n + tV. (3) 

This time discretization implies that all geometric quantities such as V are evalu- 
ated on the current surface T n . Consequently, letting As denote now the Laplace- 
Beltrami operator on T = T n , the only modification to (2) is as follows: 

k-tA s V = A s X n . (4) 

To derive a weak formulation to (2) and (4), we simply multiply them by 
test functions and integrate. If V(T) := iJ^T) and A(T) is the subspace of V(T) 
of functions with mean value zero, we thus seek V E V(T),k E V(T),/5 E X(T), 
and V E A(T) such that 

(ZM + t(VsV,V'p) = -(v s X,Vs'p) v^e^(r), (5) 

■»,<!>) = o (6) 

(v, </>) - <v S K, Vs<^>) = o v 4 > G *(r), (7) 

(v,<p)-(v,<p-V) = 0 v^GV(r). (8) 

Hereafter the symbol (•, •) stands for the L? scalar product over the current surface 
r = r n , which is described by the vector function X — X n ; we suppress the 
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superscripts n and n + 1 since no confusion arises. Since V G <T(r) has mean value 
zero, we realize that volume is preserved in the sense that f r V = 0. 



3. Finite Element Discretization 

Let T be a regular but possibly graded mesh of triangular finite elements over the 
surface T, which is assumed to be polyhedral from now on. Let h denote the local 
meshsize of T. Let T G T be a typical triangle and let Vt = t> e un ^ 

normal to T pointing outwards. We denote by V the outward unit normal to T, 
which satisfies V\ T — Vt for all TgT, and is thus discontinuous across dT. 

Let {(f)i}{ =l be the set of canonical basis functions of the finite element space 
Vh(r) of piecewise linear functions over T; this is a conforming approximation of 
V(r). If Id G R dxd is the identity matrix, consider the following matrix entries 



Mij • ? 


Aij * — i 


(9) 


Mij := Mijld, 


:= {<t>rA 3 v k ) d k=l , 


(10) 


and corresponding mass and stiffness matrices 




M := M := 




(11) 




A := (Ai^i j = i- 


(12) 



We point out that M,A and N possess matrix- valued entries and therefore the 
matrix- vector product is understood in the following sense 



/ j 

3 = 1 

because each component of V = (Vi)( =1 , as well as of MV, is itself a vector in 
R d . 

The vector of nodal values of a finite element function is written in bold 
face, namely V = (Vi)f =1 G ¥ := IR 7 is equivalent to V = J2i=i £ V/*(r). 
Let X h (T) be the subspace of V/*(r) of functions with mean value zero, and let 
X = {V G V : V -Ml = 0} be the corresponding subspace of V with 1 := (1 )f =1 , 
so that 

i 

V = v = (V$)f=i e X. (13) 

2=1 

Upon expanding the functions V, ft, V, R in terms of the basis functions 
{<t>i}{ = i and testing against the latter ones, a discretization of system (5)-(8) can 
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be written in matrix form as follows: find V G V, kG V, KeX, V GX such that 



~tA 0 M O' 




V 




~-AX 


0-4 0 M 




K 




0 


MO 0 -N 




K 




0 


1 

0 

1 

0 

1 




v_ 




0 



We discuss the solvability of (14) and propose an algorithm for its solution in §4. 



4. Schur Complement Approach 

Consider the following vector equation with a (possibly singular) square block A: 



'A 


B 


U 




F 


C 


D 


Q. 




G 



Let A be symmetric with (nontrivial) kernel ker(A). Then the range Y of A is 
the orthogonal complement of ker(A). Let S : Y — > Y be the right inverse of A: 
AS = Id on Y. If P denotes the orthogonal projection onto ker(A), we have 

SAV = V- PV = (Id - P)V V V G M 7 . (15) 

The Schur complement equation for Q then reads 

(-CSB + D)Q + CPU = G- CSF. (16) 

The solvability of this system depends on the structure of the two terms on the 
left hand-side of (16). We intend to apply this splitting to (14), which involves 
dealing with the block containing A and A on the diagonal. 

Since the kernel of A in (12) is Z = span{l}, with 1 = (1) 7 =1 , the range of 
A is Y = Z- 1 . Then the spaces X, defined in (13), and Y are related by 

VGX MV E Y. (17) 

Let S : Y — » Y be the right inverse of A and let P : Y — * Z be the orthogonal 
projection into Z, thereby satisfying (15) with P = 

Applying (16) to (14) with vectors U = [V,K] T and Q = [K^V] T , we 
obtain the symmetric system 



amsm 


N 


K 




'-\MSAX + MPV 


_ T n t 


-MSM_ 


V 




MPK 



We observe that SAX makes sense because AX E Y; this could be viewed as a 
compatibility condition. To investigate the solvability of (18), we note that both 
components of Q satisfy K G X and V G X or, in view of (17), MK E Y, and 
MV E Y. Since the upper left block of (18), MSM : X -* MY, is nonsingular 
with inverse M~ l AM~ 1 , we can apply (16) again to get 

(tN t M _1 AM -1 N + MSM^jV + MPK = -N T M~ l AX. (19) 
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Let n := Id — °rthogonal projection onto X. Since MPK G 

span {Ml} = X 1 , applying II to (19) yields the final form of the Schur comple- 
ment: 

u(tN t M- 1 AM~ 1 N + MSM^jUV = -U^'M^AX, (20) 

because II V = V. The ensuing matrix of this reduced system is nonsingular be- 
cause HMSMH : X — ► X is symmetric and positive definite, and the first term in 
(20) is symmetric and positive semi-definite. 

The method actually implemented consists of solving first for V using (20), 
next solving MV = NV for V and finally updating X via X + rV. 



5. Graph Formulation and Error Analysis 

Let us now discuss the case of T being a graph. To this end, let Q C IR d ~ l , 
d G {2,3}, and T(t) := {(x,u(t,x)) \ x G ft} C lR d , with t G [0, T] for some T > 0. 
If Q(u) := y/l + |Vu| 2 denotes the elementary surface area, then we have the 
following formulas for the geometric quantities z/, k and V : 






Q(u) 



V ■■ 



d t u 

Q(u )’ 



( 21 ) 



with V = (d Xl , . . . , d Xd _ 1 ) T . According to (21), (1) can be written as the system 



-v.q. , 22) 

This system has to be supplied with suitable initial and boundary conditions. We 
restrict to periodic boundary conditions for ease of presentation, but Neumann 
and Dirichlet boundary conditions can be handled as well, see [2]. In this vein, let 
Q be a parallelogram and X be the subspace of H 1 (fi) with periodic boundary 
values. Then, (22) admits the following variational formulation: find u and k, with 
u(t), n{t) G X for a.e. t and iz(0) = uq in a suitable sense fulfilling 

(d t u, 4>) - J Vs AC • VsV’ = o V $ g x, (23) 

<«» <P) + [ = 0 v <P e ( 24 ) 

Jq Q\ u ) 

where (*, •) denotes the L 2 (Q) inner product; compare with §2. Note that, in con- 
trast to the mean curvature flow of graphs [6, 7], (23) does not have the weight 
Q{u)~ x because the equation is written on T. 

If Xh is a finite element subspace of X , then a semi space discrete scheme 
is obvious from (23) and (24). In particular, upon taking ijj = 1 G Xh we deduce 
the exact volume conservation Uh = 0. We can also prove stability as well as 
coercivity properties, which are crucial in deriving the following error estimate [2]. 
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Theorem 5.1 (A Priori Error Estimate for the Semidiscrete Scheme). Let u,n be 

sufficiently smooth in [0 ,T\, and let k > 1 be the polynomial degree of Xh- Then, 
there exists a constant C > 0, depending on the regularity of u, k and T only, such 



that 



sup (\\{u 

t€[0,T] 

fT ,„. 

H — 



~ u h)(t)f L 2 {Q) + f |V S W - VsUft)! 2 ) < cti 
J r h (t) ' 

K h){t)\\ 2 L 2 (Q) + [ |V s k - Vg^l 2 ) dt <Ch 
Jr k (t) J 



(25) 



Since (25) involves the tangential gradients Vsu and Vsk, the order of convergence 
0(h k ) is optimal. This is corroborated by the numerical experiments of [2], which 
are obtained via a semi-implicit time discretization similar to [7]. It consists in 
writing the equations for step n + 1 on the current surface T n , which linearizes 
the system and allows for a Schur complement solver similar to that in §4. 



6. Implementation and Simulations 

In this section we explain briefly the implementation of both the parametric and 
graph formulations within ALBERT [12, 13], and document their performance. 

6.1. Mesh Regularization 




FIGURE 1 . Pathological ear formation in the evolution of a 4 x 1 x 1 
prism by surface diffusion. Surface at time t = 0 and t = 0.07. 



The geometric flow by parametric surface diffusion is not as gentle as the corre- 
sponding mean curvature flow [8], and leads to severe mesh distortions. Even if the 
parametric formulation of §3 allows corners and edges, which are rather singular 
for surface diffusion, they give rise to fast node motion and mesh distortion. This 
is illustrated by the creation of ears during the evolution of a 4 x 1 x 1 prism 
in Figs. 1 and 2 towards a ball. This is clearly a numerical artifact and cannot 
be cured by mesh refinement and/or coarsening. We use mesh regularization in- 
stead, which consists in replacing each node by the projection onto the surface of 
a weighted average of the nodes belonging to its finite element star, see also [11]. 




FIGURE 2. Steps towards the pathological formation of ears. Zoom 
into a vertex of the initial prism with surfaces at t — k x 10~ 3 for 
0 < k < 9. After 6 time steps some triangles collapse into segments, 
thereby making the mesh degenerate and producing numerical artifacts. 



This regularization is equivalent to one time step of the discretized (tangential) 
heat equation. Its beneficial effect is is displayed by the following simulations of 
Figs. 3-6, all computed with forcing term e = 0. 

6.2. Example 1: Smoothing Effect 

We illustrate the evolution of a 4 x 1 x 1 prism towards a ball with the same volume 
in Fig. 3, thereby showing the smoothing effect of surface diffusion. 




FIGURE 3. Smoothing effect of surface diffusion. Evolution of a 4 x 
lxl prism towards a ball with equal volume at various time instants. 



6.3. Example 2: Pinch-Off in Finite Time 

Surface diffusion is not always a regularizing flow and may in fact create singulari- 
ties in finite time depending on the initial configuration. This is shown in Figs. 4-6 
which correspond to prisms with larger aspect ratios than before. Figs. 4-5 display 
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FIGURE 4. Pinch-off in finite time. Evolution of an 8 x 1 x 1 prism at 
various time instants leading to a dumbbell and cusp formation. 



the evolution of an 8 x 1 x 1 prism towards a dumbbell and cusp formation in finite 
time. Increasing the aspect ratio of the prism seems to be an effective mechanism 
to produce several simultaneous cusps, as reported in Fig. 6. 




FIGURE 5. Detailed view of the pinch-off for the 8x1x1 prism. 
Mesh regularization appears to cure mesh distortion until the moment 
of pinch-off when the elements are rather elongated but not degenerate. 



6.4. Example 3: Mushroom Formation 

We consider the graph formulation of §5 of a curve in 2d starting from the initial 
condition uq(x) — 1 + S(x), where 5 is a steep perturbation with zero meanvalue 
(see Figure 7). Since the slope seems to become vertical around t = 4.8 x 10 -5 in 
Fig. 7, the classical solution might cease to exist in finite time. 

To investigate the formation of singularities in finite time, we use the para- 
metric formulation with the same initial data, upon embedding the graph of uq 
into a closed curve (see Figure 8 top left). For the time scale of Fig. 7, the effect of 
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FIGURE 6. Evolution of a 12 x 1 x 1 prism towards two simultaneous 
cusps revealing that the number of singularities depends on the aspect 
ratio of the initial prism. 



this extension is negligible. Fig. 8 displays a sequence of solutions obtained for the 
same eight time instants of Fig. 7. It is worth noticing the striking similarity of 
the solutions obtained with both methods. Even though the parametric solution 
develops a mushroom shape at t = 9.6 x 10 -5 , and thus the solution to the graph 
formulation is questionable thereon, they still exhibit an excellent quantitative 
agreement for t > 9.6 x 10 -5 (compare the last two plots of Figs. 8 and 7). 



T 



t = 8 x 10“ 6 






t = 4.8 x 1(T 5 t = 9.6 x 10- 5 t = 19.2 x 1(T 5 t = 38.4 x 1(T 5 

FIGURE 7. Evolution of a graph in 2d starting from uo(x) = 1 + 5(x) 
with a steep perturbation S(x). In all plots for various times t, the 
x-axis ranges from —1 to 1, and the y - axis ranges from 0 to 1.5. 
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FIGURE 8. Mushroom formation. Parametric evolution of a curve in 
2d with the same initial condition and the same times of Fig. 7. The 
rectangles in thin lines are [—1,1] x [0,1.5]. The effect of embedding 
the graph of Fig. 7 into a close curved (see top-left picture) is negligible 
for the time scale of this evolution. 
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Crystal Growth and Impingement in 
Polymer Melts 

Martin Burger 



1. Introduction 

Crystallization of polymeric materials is a solidification process in strong interac- 
tion with heat conduction. Both the basic mechanisms involved in the solidification 
from a melt, namely the nucleation and growth of crystals, are strongly influenced 
by the temperature and its variation. Vice versa, the latent heat is rather large for 
polymeric materials, so that it causes a considerable change in the heat transfer 
process. 

The interaction of solidification and heat conduction is a well-known phenom- 
enon in models of phase-change, and involves important problems in the theory 
of free and moving boundary problems. In typical examples of moving boundary 
problems for the heat equation such as the well-studied Stefan problem (cf. [18]), 
the unknown boundary is assumed to be an isotherm, which leads in a mathe- 
matical model to a homogeneous Dirichlet condition for the (appropriately scaled) 
temperature. For polymers, the situation is different, since the phase change does 
not take place at a fixed temperature (or with kinetic undercooling close to this 
temperature) , but in a rather large temperature range between the thermal melting 
point T m and the glass transition temperature T g . 

The nucleation of new crystals can be modeled as a heterogeneous Poisson 
process in space and time, with rate a = a(x,t). For many important materials 
such as isotactic Polypropylene (i-pp), a model for the nucleation rate in depen- 
dence of the temperature u is given by a(x,t) = ^N(u(x,t)) (cf. [6]), with some 
material function TV, which varies only in the interval (T p ,T m ), i.e., no crystals 
nucleate if the temperature is below the glass transition temperature or above the 
melting point. Usually it is assumed that new crystals nucleate as small spherical 
objects with initial radius Ro determined from classical nucleation theory. The nu- 
cleation process has been well investigated with respect to its stochastic properties 
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Figure 1. Impingement of polymer crystals (right) and a final 
tesselation from an experiment with isotactic Polypropylene. 



(cf. [9, 20]), its averaging (cf. [5, 8]), its numerical simulation (cf. [19]), as well as 
the determination of the nucleation rate from indirect measurements (cf. [2, 7]). 

The growth of the crystalline phase 0 = IJili (0i(£) being the single 
crystals), is determined via a nonlinear Gibb s-Thomp son relation for the velocity 
Vn and the temperature u, 



V N = G(u) 



du 

dn 



on T := dO - dtl, 



(i.i) 



where G is a given material function, L is the latent heat, and [J^] denotes the 
jump of the normal derivative on the boundary. For a certain temperature interval 
(e.g. for u > 80° C in the case of isotactic Polypropylene) the function G is nonin- 
creasing, so that the relation can be rewritten as u = G _1 (V/v), which is a nonlin- 
ear version of the typical condition used for kinetic undercooling. Surprisingly, the 
function G usually reaches a maximum at some temperature (depending on the ma- 
terial) and is non-decreasing below it, which is confirmed by some experiments (cf. 
[21, 25]). The experimental background for this growth model in a nonisothermal 
situation was provided by Schulze and Naujeck [26], who considered the growth of 
a crystal in an environment with strong temperature gradients and found that the 
obtained shapes coincided with the ones predicted by a minimal-time principle. In 
[8], the growth model was reformulated in arbitrary spatial dimension as (1.1) (cf. 
[28] for an overview of different formulations of crystal growth models ). Motivated 
by the physics of the problem we shall assume that G is a smooth function with 
compact support, which is nonnegative and globally bounded. 

The heat transfer in the material is described by the heat equation in ft — 90, 
in general with parameters (such as conductivity, density, and capacity) depending 
on the phase. For simplicity we restrict our attention to the case of all parame- 
ters being constant, which yields, together with the second identity in (1.1), the 
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nonlinear parabolic equation 
f) u 

c— — kAu = LSrG(u) in Q x (0 , T), (1.2) 

where Jr denotes the Dirac delta distribution on T. This equation is supplemented 
with appropriate initial conditions at time t = 0, i.e., 

u = uo in Q x {0}, (1.3) 

and with boundary conditions on dft x (0,T); for simplicity we restrict our atten- 
tion to Dirichlet conditions of the form 

u = ud oncftlx(0,T). (1.4) 

We finally note that (1.2) has an equivalent enthalpy formulation of the form 
de 

— — kAu = 0. e — cu — Lx , (1.5) 

with x being the indicator function of D. 

In the following we shall discuss the mathematical analysis and numerical 
simulation of the initial-boundary value problem (1.2), (1.3), (1.4), coupled to 
the growth law (1.1). We assume that the nucleation events {(Xj,Tj)} J= 
representing the origin and time of nucleations, as well as the initial shape of a 
nucleated crystal (a small spherulite), are given a-priori, analysis and simulation 
of the stochastic aspects of nucleation shall be discussed elsewhere. 

2. Mathematical Analysis 

In the following we shall be concerned with the mathematical analysis of (1.1) 
coupled to the heat transfer problem (1.2), (1.3), (1.4). The analysis of this problem 
is challenging in the single crystal case already, where only local-in-time existence 
and uniqueness could be proven so far by Friedman and Velazquez [11] using fixed- 
point arguments and regularity (of the temperature and the free boundary) in a 
Holder class C 2+c \ a > 0. 

In presence of multiple crystals, the mathematical analysis of the crystal 
growth via (1.1) leads to additional difficulties: 

1. Because of the non-equilibrium condition at the interface, there is no weak 
formulation of the problem (such as for the two-phase Stefan problem, cf. 
[18]) that would circumvent the problem of dealing with the interface 
explicitly. 

2. Even before impingement the growth of a fixed crystal is influenced by 
all the other crystals via changes in the temperature, i.e., we have to deal 
with a nonlocal front growth problem. 

3. One cannot expect much regularity of the free boundary T even if all 
parameters and initial shapes are analytic. This can be seen from a sim- 
ple case with G = 1 and two spherulitic crystals. Before impingement, 
the shape T(t) is analytic, but the impingement creates a non-Lipschitz 
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singularity. For each time after the impingement event, the shape T(t) is 
only locally Lipschitz. Thus, the best type of regularity we can expect is 
“locally Lipschitz almost everywhere”. 

A detailed analysis can be carried out in the spatially one-dimensional case, 
where regularity of the free boundary plays no role since all crystals are com- 
pact intervals and T{t) consists of a finite number of points. Global existence and 
uniqueness for this case has been obtained recently (cf. [3]); here we only state the 
main result: 

Theorem 1. Let Cl C R 1 be a bounded interval, let T E R + be arbitrary and let 
( Xj,Tj ) E Cl x [0, T], j = 1, . . . , M be given nucleation events. Then the coupled 
problem (1.1), (1.2), (1.3), (1.4) has a unique weak solution 

u E L°°(0,T; C 0,1 (fl)) and T E L°°([0, T]; /C M ), 

where Km is the set of all subsets of Cl with cardinality less or equal to 2 M equipped 
with the H aus dor ff metric. Moreover, for each time interval [Si, 62] that does not 
contain a nucleation or impingement event, we have 

ueC(S u S 2 ;C°’ 1 (n)) and r e C' 1 ([0,T];/C m ). 

In multi-dimensional situations, a promising approach is the representation 
of the growing crystalline phase via the level set approach, i.e., 

0(£) = { x E ft | <t>(x,t) < 0 }, (2.1) 

where 0 is a continuous functions. A weak formulation of the evolution of Cl(t) and 
T (t) = dCl(t) via (1.1) is provided by computing a viscosity solution of the first- 
order Hamilton- Jacobi equation (cf. e.g. [10, 17] for details on viscosity solutions) 

^ + V\V<p\ = 0, in R d x (0, T), (2.2) 

where V is the extension of the velocity Vn to the whole space H d and the initial 
value 0(.,O) is a uniformly continuous function satisfying (2.1) for t = 0. This 
approach, originally introduced for numerical purposes by Osher and Sethian [23], 
does not rely on any parametrization and allows to treat topological changes in 
an automatic way. We refer to the monographs by Osher and Fedkiw [22] and by 
Sethian [27] for an overview of the level set method and its applications. 

Nucleation events can be included by restarting the level set evolution at each 
nucleation time with a new level set function including the nucleated object, and 
therefore we shall restrict our attention to a time interval (0, T) without nucleation 
in the sequel, assuming that some initial shape 0(0) is given. If we assume that a 
temperature u E C(0, T; C 0,1 (O)) is given and if the domain Cl has a sufficiently 
smooth boundary, we can find some extension u on H N x (0,T) such that the 
(natural) extension velocity defined by V = G(u) is bounded, nonnegative, and 
Lipschitz continuous. For such an evolution we can collect the following results: 
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• Well-posedness: For each uniformly continuous initial value, the level set 
equation (2.2) has a uniformly continuous unique viscosity solution (cf. 
[10]). 

• Stable dependence: The solutions of the level set equation depend in a 
Lipschitz-continuous way (in the norm of C(0, T; C 0,1 (Q))) on the velocity 
y (cf. [12]). 

• Non- fattening: If we start with a level set function such that T(0) = {xe 

| (j){x, 0) = 0 } has zero Lebesgue measure, then, for all t > 0, the set 
r (t) coincides with the zero level set of </>(.,£), which has zero Lebesgue 
measure (cf. [1]). A recent result by Ley [14] shows that even an estimate 
of the form |V0| > rj > 0 holds in the viscosity sense in a neighborhood 
of T(t). 

• Partial regularity: If we assume in addition that V > p in £1 x (0, T) for 
some p G R + (which is the case in the temperature range of interest), 
then for almost every t E (0,T), the set T(t) is locally Lipschitz almost 
everywhere with respect to the l~L d ~ x -measure (cf. [4]). 

Hence, by the level set approach we obtain a well-posed evolution, which yields 
exactly the maximal regularity we expected. 

An interesting open problem for future research is the coupling of the level set 
evolution with the heat transfer model (1.2)-(1.4). A promising approach to global 
existence seems to be the use of a relaxation model as for the Stefan problem (cf. 
e.g. [29]) and (or) some smoothing of the form y e = / e (<^), with f e approximating 

( 1 if s < 0 

f(s) = < [0, 1] if s = 0 . 

[ 0 if s > 0 

For the smoothed problem, the map </> i— ► u is continuous and compact, which 
allows, together with the above stable dependence results for the solutions of the 
level set equation, to use a fixed point argument to deduce the existence of a 
solution. Moreover, analogous a-priori estimates as in [29] can be obtained for 
the temperature u in the norm of L 1 (0,T; VF 2,1 (£1)), which might allow to deduce 
existence by similar weak-star convergence techniques as in [29], 

3. Numerical Simulation 

The numerical simulation of crystal growth consists of two main tasks, namely the 
numerical solution of the level set equation (2.2) and of the heat transfer problem 
(1.2)-(1.4). In addition, we have to implement the coupling of these problems via 
the velocity V = G(u) and the source term in (1.2). 

For solving first-order Hamilton- Jacobi equations like (2.2), several schemes 
have been proposed over the last two decades, most of them using finite differ- 
ences. A frequently used class of solvers are essentially nonos dilatory schemes (cf. 
[24]), originally developed for conservation laws. This type of schemes allows high 
order approximation of solution, can be implemented efficiently, and is explicit 
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in time (like the majority of schemes for conservation laws and Hamilton- Jacobi 
equations). As any other explicit scheme, there is a stability bound on the time 
step r in dependence of the spatial grid size h, namely the so-called CFL- condition 

r sup \V(x,t)\ < h. (3.1) 

X 

Since the velocity V — G(u) is usually rather small for polymeric materials, this 
stability bound is not restrictive and the actually chosen time step is usually far 
below the one allowed by (3.1). 

Another consequence of an explicit time scheme for the level set equation is 
the possibility to use its result as an input for the time step in the heat equa- 
tion, which should be implicit, since the stability bound for a parabolic equation 
(t ~ h 2 ) is quite restrictive. Thus, having computed </> at time t + r, we can apply 
the (horizontal) method of lines to the heat equation (1.2) or (1.5) and use the 
the values of 0 to assemble the right-hand side, leading to an elliptic boundary 
value problem in each time step. For the spatial discretization of the arising elliptic 
problem, we have several choices. A first possibility is to choose a finite element 
grid matching the interface, which leads to high accuracy but also to an enormous 
numerical effort, since the grid has to be changed in each time step. More efficient 
are methods with non-matching grids such as the immersed interface method (cf. 
[15, 16]), which is a finite difference method on a rectangular grid with additional 
correction terms in those cells including parts of the interface, or a method ap- 
proximating the interface terms by smoothed quantities, which is frequently used 
in the context of level set methods (cf. [22]). For the latter, the simplest possi- 
bility consists in using the enthalpy formulation (1.5) and approximating x by 
X e — f e {4 > )i with f e as in the previous section, which yields an equation of the 
form 

c ~ - kA-u= -LS € (4>)~, (3.2) 

with S e approximating the Dirac delta distribution. After time discretization, we 
can employ a finite element method on a mesh fixed during the evolution or at 
least during several time steps. The mesh generation should be fine around the 
initial interface and consequently also around the interface after some time steps, 
provided the motion is not too fast. Only if the mesh around the interface becomes 
too coarse after some time steps, the domain has to be remeshed. In this way, 
the numerical effort can be reduced significantly. For computations with a large 
number of crystals (see [19] for such simulations), it seems of advantage to use a 
fine uniform grid (fixed during the evolution), and to apply fast Poisson solvers 
for the solution of the elliptic problem. 

Finally, we show a numerical experiment with two crystals in a domain scaled 
to the cell [— 1,1] 2 . We use material parameters in a range typical for isotactic 
Polypropylene (cf. [5] for numerical values), and use a temperature scaling such 
that L — 1 (around 70° unsealed). The (unsealed) initial value is a uniform tem- 
perature uq = 124° C. The heat transfer parameters, which are typically small for 
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polymers, are chosen as c = k = 10 -2 . All solvers were implemented in the soft- 
ware system MATLAB, for the heat equation we use a finite element discretization 
with n = 19968 triangles. The level set equation is solved on a uniform grid with 
256 x 256 elements using a weighted ENO-scheme of fifth order in space and third 
order in time, with a Godunov flux, as proposed in [13]. 

In the first example, shown in Figure 2, we use a uniform Dirichlet value of 
u — 88° C at the boundary. In this case, one can observe a strong reheating effect 
due to the release of latent heat at the crystal boundary and the free boundary 
is almost an isotherm of the temperature. Moreover, the absence of strong tem- 
perature gradients in the crystals causes the shape to stay close to a spherical 
shape. Consequently, the crystals heat each other at a single point and the contact 
interface propagates outward from this point. 

The behavior in the second example (shown in Figure 3) is different due 
to different boundary values of the temperature, which are very low on the left 
and upper part of the rectangle (around 70° C) and rather high at the right and 
lower part (around 140 °C). This yields a temperature gradient mainly in diagonal 
direction, with some changes due to reheating. In particular, the temperature 
around the lower crystal is so high (and the value of G(u) so low) that there is 
almost no growth, while the second crystal grows to a large, non-spherical shape. 
The impingement starts from two different contact points in this case and leaves 
a small liquid part inside the crystalline phase for some time. 
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Figure 2. Color plot of the temperature and shape of the grow- 
ing crystals in the first example: at the initial stage (t = 1.09, 
t = 4.72), short time before (t = 12.9), at (t = 15.1) and after 
impingement (t = 15. 8 , t — 19.1). 









Figure 3. Color plot of the temperature and shape of the grow- 
ing crystals in the second example: at the initial stage (t = 19.7, 
t — 34.7), short time before (t = 88.7), at (t = 93.8) and after 
impingement (t = 97.3, t = 100). 
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Moving Bands and Moving Boundaries in 
an Hybrid Model for the Crystallization 
of Polymers 

Vincenzo Capasso, Ramon Escobedo, and Claudia Salani 



Abstract. Crystallization is a mechanism of phase change in polymeric mate- 
rials; it consists of at least two processes, a nucleation (birth) process describ- 
ing the time and the location of spots in the material where crystals start 
growing, and a growth process of the nucleated crystals. 

Both processes are coupled to temperature, and viceversa the tempera- 
ture field is coupled with the crystallization process due to the production of 
latent heat. 

All the processes described above are of a random nature, since the 
birth process and the consequent geometry of the crystalline phase are ran- 
dom. However, under typical industrial conditions (many and small crystals), 
a multiple scale assumption can be made so that a deterministic approxima- 
tion (homogeneization) for the spatial density of the crystalline phase at the 
(macro) scale of the temperature field is possible. 

Thus, at the macroscale the whole crystallization process may be mo- 
delled as a reaction-diffusion system in which a deterministic ODE for the 
crystalline density is coupled with a deterministic PDE for the temperature 
field via the kinetic parameters of birth and growth on the one hand and the 
latent heat on the other hand. 

Numerical simulations of this reaction-diffusion system show that the 
solution exhibits an advancing moving band of crystallization in the mass 
distribution, accompanied by a moving boundary in the temperature field, 
both advancing with a decreasing velocity. 



1. Introduction 

Crystallization is a mechanism of phase change in polymeric materials; as for 
various similar processes, it consists of at least two processes, a nucleation (birth) 
process describing the time and the location of spots in the material where crystals 
start growing, and a growth process of the nucleated crystals. 

Both processes are coupled to temperature, since in general the kinetic pa- 
rameters of birth and of growth depend significantly upon the local temperature 
field, and viceversa the temperature field is coupled with the crystallization pro- 
cess due to the production of latent heat, and the possible dependence of the 
parameters of the heat conduction upon the phase (liquid or solid) [1]. 




76 



V. Capasso, R. Escobedo, and C. Salani 



Growth processes may be very complicated, but in the sequel the restriction 
to the case of spherulitic growth is made, which is also a good assumption for the 
crystallization of relaxed polymer melts. 

All the processes described above are of a random nature, since the birth 
process and the consequent geometry of the crystalline phase are random. However, 
under typical industrial conditions, a multiple scale assumption can be made, due 
to the fact that at the typical spatial scale of the temperature field (macroscale), 
many and small crystals are present, thus allowing a deterministic approximation 
(homogeneization) at the macroscale for the spatial density of the crystalline phase 
and consequently of the temperature field. This approach has been carried out in 
a rigorous way in [7] via suitable “laws of large numbers” . 

Thus, at the macroscale, the whole crystallization process may be modelled 
as a reaction-diffusion system in which a deterministic ODE for the crystalline 
density is coupled with a deterministic PDE for the temperature field via the 
kinetic parameters of birth and growth on the one hand and the latent heat on the 
other hand. In particular, we have introduced a critical temperature above which 
no nucleation and no growth can take place. Also, we have taken into account the 
possibility that the rate functions may present a jump at the critical temperature. 
Numerical simulations of this reaction-diffusion system show that the solution 
exhibits an advancing moving band of crystallization in the mass distribution, 
accompanied by a moving boundary in the temperature field, both advancing with 
a decreasing velocity. The fact that the speed is not constant, that is, that two 
different space segments of the same length have different times of crystallization, 
is a relevant aspect for the industrial process, and it is greatly interesting from the 
mathematical point of view. 

On the other hand, if we wish to obtain geometric information about the 
crystallization process such as the evolution equations for the interface densities, 
we may couple the averaged deterministic temperature field with the stochastic 
birth-and-growth process (hybrid model) (see [14] and Sec. 4). 



2. A rigorous geometric approach 

In this section we describe the crystallization process in a given deterministic 
field of temperature, neglecting at the moment the influence of the crystallization 
process on temperature, which we shall discuss later. 

Crystallization is a stochastic process in time and space due to the combi- 
nation of birth (nucleation) and growth of crystals. The first one is modelled as a 
marked point process, and the whole process is described by a dynamic Boolean 
model. The nucleation process is modelled as a stochastic spatially marked point 
process N on an underlying probability space (fl, A, P), with marks in the physical 
space E , a compact subset of R d , d — 1, 2, 3 (we shall denote by B ^ + , resp. £ , the 
cr-algebra of Borel sets on M + , resp. on E). The marked point process (MPP) N is 
a random measure given by N = e T n ,x ri where T n is an Revalued random 
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variable representing the time of birth of the n — th nucleus, X n is an EWalued 
random variable representing the spatial location of the nucleus born at time T n , 
and e t , x is the Dirac measure on x E. The crystalline phase at time t > 0 
is given by a random set 0* = [j 0* (see [ 1 ], Sec.5.1), which is the union of 

Tj<t 

all the crystals born at times Tj and locations Xj and freely grown up to time 
t. The integer valued random variable iV([0,£] x B) counts the (random) number 
of nucleation events occurred in the time-space region [0 ,£] x B G Sr + x £. We 
assume that the infinitesimal probability of birth of a new crystal during the time 
interval [t, t + dt], in the free spatial interval [x,x + dx] C E, is given by (see also 

[14]) 

uq ( dt x dx) = a(x,t) dx dt. 

The (degree of) crystallinity £(x,t), at location x E E and time t > 0, is the 
probability that, at time t, x is covered (or “captured”) by some crystal, i.e. 

£(M) ; = P(% e © 4 ) = E[l e t(x)]. 



As a complement to the degree of crystallinity (or probability of capture) 
£(#,£), one can define the survival function or porosity of the point x at time t, 
which is the probability p x (t) that the point x is not yet crystallized at time t, i.e., 
p x {t) = P(x 0 Q l ) — l — £(x,t). Correspondingly, we may introduce the hazard 
function h(x,t) as the rate of capture, i.e., the rate of the capture probability, 

7 / X I- P ( X G (6 t + At 

h(x, t) = lim 

v ’ ; At — >-0 At 



We can see [ 1 ] that 

Q- t £ 0 M) = (1 - £(x,t))h(x,t). 

Under the assumption of growth along the normal to the external surface due 
to a given field of growth rates G(x,t), and assumptions of sufficient regularity of 
the field of birth rates a(x,t), it can be shown [5] that for almost any x G E and 
any t G M+ 

h(x, t) = G(x, t)S ex (x , t) ( 1 ) 

where S ex is the mean value of the extended surface of the crystals, defined as 
follows in ( X d denotes the usual Lebesgue measure). 



El ex( x 5 ^) — lim E 

r j 0 



^ A <, - 1 (3e$nfl r (a;)) 
h, ^(Br(x)) 



Clearly S ex (x,t) represents the expected value of the spatial density of the 
sum of the surfaces of the individual crystals which are born and develop indepen- 
dently of each other. From the above considerations, we have in this case 
d 

— ^(x, t) = G(x, t)S ex (x, t)( 1 - £(x, t)). 
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This formula may be seen as an extension of the Kolmogorov- Avrami-Evans for- 
mula to the spatially heterogeneous case. 

In order to obtain an explicit evolution equation for £(#,£), we need to couple 
the previous one with an evolution equation for S ex (x,t). This has been done in 
[4] as follows. 

In R d we introduce first the (stochastic) analogue u(x,t) of S ex (x,t) such 
that for any B G we have 

f u(x,t)\ d (dx ) = A d ~ 1 (dO t j HB). 

dB Tj <t 



In R 2 , with given fields a(x,y) and G(x,y), we 

for u 



have the following system 



— - di v(Gv) + Sj 

Tj=t 

| = V( G „) 

u = 0 
v = 0 
u + v • n = 0 



in E x R + 

in E x M + 

in E x {0} 
in E x {0} 
on dE x R_|_, 



(2) 



where the only source of stochasticity is due to the nucleation Y^r =t ls 

generalized function that describes the stochastic event of birth of the j-th crystal 
in R 2 (system (2) has to be understood in a weak sense). 

Since the system is linear and 



Sex(x,t) := E[u(x,t)\, 



we can obtain the deterministic equations 



dt 

dE[v\ 
< dt 



div(G£[t;]) + £?[53 S 2 j 

Tj =t 

-- V(GS ex ), 



( 3 ) 



subject to the corresponding boundary and initial conditions. In R 2 (see [4]) 



E 



X s i 



2n G(x,t) f a(x,t)ds. 

Jo 



Once available the hazard function and the growth rate, we may obtain fur- 
ther information about geometric measures regarding the crystallization process, 
such as the volume densities of interfaces of crystals at all Haussdorff dimensions 
(see [14] and [6]). 




Moving Boundaries for Crystallization 



79 



3. Coupling with temperature 

In an experimental situation where spatial heterogeneities are caused only by the 
heat transfer in the material, we may assume that growth and nucleation rates are 
determined as material functions of the temperature [9]. 

G(x, t) — G(T(x , t)), a(x, t) = a(T(x , t)). (4) 

Viceversa, the increase of the crystalline phase influences the heat transfer 
process in the material because of the release of latent heat, i.e., the temperature 
T is determined by the heat transfer model 

r\ r\ 

— (pcT) = div (kVT) + — (/i/?7 0t ), in R + x E, (5) 

3T 

— =P(T-T out ), on R+xdE, (6) 

where ^ denotes the (outer) normal derivative on dE. 

Here J^(/©t) denotes the (distributional) time derivative of the indicator 
function 1st of the crystalline phase at time £, noting that Equation (5) has to be 
understood in the sense of distributional derivatives. The parameter p denotes the 
density, c the heat capacity, k the heat conductivity, (3 the heat transfer coefficient 
and h the latent heat due to the phase change. 

The parameters in the heat equation in general may depend upon the phase. 

Thus the heat transfer model is now a random partial differential equation, 
since all parameters and the latent heat depend upon the random variable 7©t . A 
direct consequence is the stochasticity of the temperature field, whose evolution 
influences (via (4)) the crystallization process itself (see also [3]). 

Under these circumstances, System (2) is doubly stochastic, and as much 
nonlinear with respect to the operator E (expected value). In particular, we notice 
that 

E[div(Gv)] ^ di v(E[G]E[v}) 

since now G(T(x,t )) is a stochastic quantity itself. 

To overcome this complication, we may take into account the fact that under 
typical industrial conditions (many and small crystals), we deal with a double 
scale process, so that one may approximate the stochastic model presented above 
in two ways. If we wish to obtain geometric information about the crystallization 
process such as the evolution equations for the interface densities, we approximate 
the stochastic temperature field with an averaged deterministic one and so go back 
to the approach described in Section 2 (Hybrid model). If we are just interested 
in the evolution of the crystallinity (volume density of the crystal phase), then we 
may approximate the stochastic system with a fully deterministic one, as done in 
[7] by using an interacting particle system coupled with the temperature field, and 
proceed via suitable “laws of large numbers” . 
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4. The deterministic reaction-diffusion model 



In the deterministic model the geometry is lost. We can make use of this fact as- 
suming that a symmetry exists in the ^-direction, so that the model can be studied 
in the one-dimensional case. For completeness, we reproduce here the model ob- 
tained in [1]. It consists of the following equations for the crystallinity y(x,t ) and 
the temperature T(x, £), for x G [0, L] and t > 0, 



dy 

dt 


L ' J ' 

1 3(y)b G (T)+v 0 K(y)b N (T ) 


(7) 


dT 


d 2 T , , 




~dt 


v ^2 + a c (3(y)b G (T), 


(8) 



and of the initial and boundary conditions 



y{x,0) 


= y\m(x), xe[o,L], 


(9) 


T(x, 0) 


= T ini (x), xe[0,L], 


(10) 


T(0,t) 


= ^app Vi > 0, 


(11) 


dT(L,t) 

dx 


= 0 Vi > 0, 


(12) 



where K,(y) and (3{y) reproduce the features of crystallization phenomena, respec- 
tively, the start of the nucleation, and the aggregation and saturation of nuclei 
(see [7]): 



/%> = { 9(1 o t,) 



in [0, 1] 
outside 



«(y) = 



(1 - y) 2 in [0, 1] 
0 outside 



(13) 



The constants cr, Vo and ac are the diffusion coefficient, the initial mass and the 
non-isothermal factor, respectively. The functions b^(T) and bc(T) are, respec- 
tively, the nucleation and growth rates for a temperature T; they are typically 
given by exponential functions with a negative exponent: 



b N (T) = N 0 e- MT - Tr ° f) , b G (T) = G o e~ 0G{T ~ T ^ ) , (14) 



where N and G are the nucleation and growth factors, (3n and (3g the nucleation 
and growth exponents, and T re f is a temperature of reference (usually T re f = 0). 
They have been obtained via statistical regression analysis in [7] and Ref. 6 therein. 

It has been shown in [10] that a threshold temperature of freezing Tf must be 
introduced, above which no nucleation and no growth can take place: 
b^{T > Tf) = bc(T > Tf) = 0. The reason is that without this threshold, 
the full crystallization of the sample would be only a matter of time for every ap- 
plied temperature, which may be not realistic. The introduction of Tf can be done 
simply by truncating the exponentials in (14), so that the rate functions will be 
discontinuous functions of T, or by modifying (14) to make the exponentials decay 
continuously to (T/,0). To our understanding, there is no experimental basis to 
decide which option is the most realistic one. The second option can be preferred 
because of the continuity, but Nature is sometimes not continuous, and the dis- 
continuous option is in fact the general case (the other one being a particular case 
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of discontinuity of size 0), and it is mathematically more interesting, the casuistry 
being (not surprisingly) more rich. 

For simplicity, we can assume that both rate functions can be reduced to 
a general rate function 6(T) such that bw(T) = N$9(T) and bc{T) = Go0(T), 
where 



0( - T) 1 - a 0 e~ T f/ w { 



e _/3T - aoe-^f/ 10 

0 



if T < T f 
if T > T f 



(15) 



ao being a measure of the discontinuity of 6 at T = Tf. 

From the mathematical point of view, this model defines a Free Boundary 
Problem (FBP) for the temperature field. Let us illustrate this by means of a 
simple model of heat diffusion. Let L = 7r/2 and u(x,t) be the solution of 



'U>t 'U'xx — 0} 


(16) 


u(x, 0) = sinx, 


(17) 


u(0,t) = 0, 


(18) 


^'x(-^ / 5^') 0. 


(19) 



In this problem it is not difficult to establish the existence, the uniqueness and 
the monotonicity of the solution. In fact, u is given by u(x,t) = e -t sinx. Then, 
for a given value a £ (0, 1) and for each t £ [0, In ^], there exists a unique abscissa 
Xb(t) £ [0,L] such that u(xb(t),t) = a: 

Xb(t) = arcsin (ae*). (20) 

At a given instant of time, Xb(t) determines a boundary in the interval [0, L], in 
the sense that it separates [0, L] into two intervals with some qualitative difference, 
namely, the sign of u — a. Eq. (20) gives the time evolution of this boundary value. 
In this case, Xb evolves with an always positive and increasing velocity, and it 
reaches the end of the interval at t — In see Fig. 1. 




Figure 1 . Left: Solution u(x,t) of the heat equation at four in- 
stants of time. At the height a = 0.3 the boundary point Xb(t) 
moves with an increasing velocity and arrives tox=|att^l.2. 
Right: time evolution of Xb(t). 
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In a similar way, a boundary value Xb(t) is expected to be found in the poly- 
mer crystallization temperature field. Our ideal resolution of the polymer problem 
is to provide the same description as in the simple heat diffusion problem. Never- 
theless, for such a problem it is more complicated to obtain analytical expressions 
or even estimates of the velocity and of the arrival time. 

A first attempt in this direction has been recently presented in [10], where 
Eqs. (7) and (8) have been solved numerically with the conditions (9)— (12) and 
the parameter values shown in Table 1. 

Table 1. Typical values and units of the parameters. 



Symbol 


Physical meaning 


Typical value 


Units 


Tapp 


applied temperature 


0-100 


°C 


a 


diffusion coefficient 


0.01 




Tni 


initial temperature 


100 


°C 


2/ini 


initial crystallinity 


0 


- 


Tf 


nucleation temperature 


40 


°c 


T a 


growth temperature 


40 


°c 


Tre f 


reference temperature 


0 


°c 


No 


nucleation factor 


1000 


s -1 


Go 


growth factor 


0.5 


s _1 


Pn 


nucleation exponent 


0.1 


rc)- 1 


Pg 


growth exponent 


0.1 


rc)- 1 


Vo 


initial mass 


0.01 


- 


&G 


non-isothermal factor 


2500 


°c 


L 


length of the sample 


10- 3 


m 



The result is that the temperature field exhibits an advancing boundary point 
at height Tj, approximately. This moving boundary is accompanied by an advanc- 
ing front of crystallization in the mass distribution. Assuming that T(x,t) is a 
monotone increasing function of x at every time t, as in the simple heat diffusion 
problem, Xb(t) can be defined as the unique value such that 

T(xb(t ), t ) = Tf. 

We can also define two abscissae to describe the mass front, 

x a (t) = max{x/ y(x,t) = 1} (21) 

= min{x/ y(x,t) = 0 }, ( 22 ) 

in order to characterize the solution in terms of the time evolution of these three 
parameters. 

It appears now clearly that a Free Boundary exists in the temperature field, 
in the sense that Xb(t) divides the interval [0, L\ into two parts, depending on the 
degree of crystallization: at time t , the part x < Xb(t) is almost fully crystallized 
(y = 1), whereas the part x > Xb(t) is almost empty of crystals (y = 0). 

The following Free Boundary Problem can be formulated: 

The free boundary Xb(t) is a time- dependent parameter which divides [0, L\ 
into two intervals I\ = [0, Xb] and I 2 = [#*>, L\. Inside each interval, the temperature 
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Figure 2. Left: Time evolution of the three coordinates which 
characterize the solution: the moving boundary Xb{t) of the tem- 
perature field and the mass band (x a (t),xp(t)). The velocity is 
slightly decreasing. Right: Crystallization time for two strategies 
of cooling in a slightly different case (a = 0.002, Tf = 70, N 0 = 20 
and Go = 5): cooling only at one side of the sample (solid line), 
and at both sides (dot-dashed line). 

T(x,t ) solves the heat equation T t = crT xx with the initial condition (10) and the 
boundary conditions (11) and (12). The degree of crystallinity is such that y = 1 
in Ii and y = 0 in I 2 , and Xb(t) describes the crystallization process depicted in 
Fig. 2. 

Still, we need an equation for the evolution of Xb(t) and the boundary condi- 
tions for T(x,t) at x£(t) and x^(t). The difficulty in completing the formulation 
lies in that the regions where y — 1 and y — 0 are in fact separated by a narrow 
band, and its width may not allow us to reduce the mass front to a point. It has 
been shown numerically that the mass front may be considered more a band than 
a layer [10], and an asymptotic argument for sufficiently large samples has recently 
established that what happens in the interior of the band can not be neglected [11]. 

Fig. 2 (Left) shows the monitoring of the crystallization process by means of 
the three abscissae defined above. It is worth noticing that both the band of crys- 
tallization and the boundary in the temperature field advance with a decreasing 
velocity, which implies that two different space segments of the same length have 
different times of crystallization. We have also obtained numerically the arrival 
time defined in the simple heat diffusion problem; see Fig. 2(Right). The variation 
of the crystallization time as a function of the applied temperature is, as expected, 
monotonically increasing, but not linear. Thus, a better strategy of cooling can 
be suggested: namely, to cool the sample at both sides with a smaller (i.e. less 
expensive) temperature. The difference with the first strategy is as greater as the 
applied temperature is smaller, that is, as smaller is the time (i.e. the money!) 
spent in the process. 

Analytical results are more difficult to obtain. For instance, we have provided 
a numerical temperature field that can be used in the hybrid stochastic model. We 
have used this field in a stochastic simulation of model (2), and we have displayed 
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the result in Fig. 3, where we represent the degree of crystallization at different 
times. 






(ime=flO s 







Figure 3. Evolution of the crystalline phase. 



At the beginning of the process crystals develop randomly where the tem- 
perature is lower than T/, but after a short time interval (less then 10 sec.), the 
crystallization front becomes deterministic and follows the evolution of the tem- 
perature field. In Figure 4, we can see the crystalline front compared with the 
moving boundary Xb(t) of the temperature field. 




Figure 4. Crystalline front (solid line) and moving boundary 
Xb(t) of the temperature field (dotted line). 








Moving Boundaries for Crystallization 



85 



Acknowledgments 

Financial support is acknowledged to the EU under the TMR-Network Differential 
Equations in Industry and Commerce , grant N.FMRXCT97-0117, to the Italian 
MURST/cofin, program “Stochastic Processes with Spatial Structure”, and to the 
Italian CNR contract n. 98. 03635. ST74. 



References 

[1] V. Capasso, Ed., Mathematical Modelling fo Polymer Processing. Polymerization, 
Crystallization, Manufacturing. Mathematics in Industry, ECMI Subseries, Vol. 2, 
Springer-Verlag, Heidelberg, 2003. 

[2] M. Avrami, Kinetics of phase change. Part /, J. Chem. Phys., 7 (1939), 1103-112. 

[3] M. Burger, Crystal Growth and Impingement in Polymer Melts , this volume. 

[4] M. Burger, V. Capasso, C. Salani, Modeling multidimensional crystallization of poly- 
mers in interaction with heat transfer, Nonlinear Analysis: Real World Applications, 
3 (2002), 139-160. 

[5] V. Capasso, A. Micheletti, The hazard function of an inhomogeneous birth-and- 
growth process, submitted. 

[6] V. Capasso, A. Micheletti, M. Burger, Densities of n-facets of incomplete Johnson- 
Mehl tessellation generated by inhomogeneous birth- and- growth processes, submitted. 

[7] V. Capasso, D. Morale and C. Salani, Polymer Crystallization Processes via Many 
Particle Systems , in [1], 243-259. 

[8] V. Capasso, C. Salani, Stochastic-birth- and- growth processes modeling crystallization 
of polymers with spatially heterogeneous parameters, Nonlinear Analysis: Real World 
Applications, 1 (2000), 485-498. 

[9] G. Eder, H. Janeschitz-Kriegl, Structure development during processing : crystalliza- 
tion, in Materials Science and Technology, Vol. 18 ( H. Meijer, Ed.), Verlag-Chemie, 
Weinheim, 1997. 

[10] R. Escobedo, V. Capasso, Moving bands and moving boundaries with decreasing speed 
in polymer crystallization , submitted. 

[11] R. Escobedo, V. Capasso, Reduction of a Mathematical Model for Polymer Crystal- 
lization , in Progress in Industrial Mathematics at ECMI2002, Springer (to appear). 

[12] V.R. Evans, The laws of expanding circles and spheres in relation to the lateral 
growth rate of surface films and the grain-size of metals , Trans. Faraday Soc., 41 
(1945), 365-374. 

[13] A.N. Kolmogorov, On the statistical theory of the crystallization of metals , Bull. 
Acad. Sci. USSR, Math. Ser. 1 (1937), 355-359. 

[14] A. Micheletti, V. Capasso, The stochastic geometry of the crystallization process of 
polymers , this volume. 




86 



V. Capasso, R. Escobedo, and C. Salani 



MIRIAM - Milan Research Centre 

for Industrial and Applied Mathematics 

and Dept. of Mathematics, University of Milan 

via Saldini 50, 20133 Milan, Italy 

E-mail address : Vincenzo . CapassoOmat . unimi . it , 

escobedo@mat . unimi . it , salani@mat . unimi . it 




International Series of Numerical Mathematics, Vol. 147, 87-101 
© 2003 Birkhauser Verlag Basel/Switzerland 



Upscaling of Well Singularities 

in the Flow Transport 

through Heterogeneous Porous Media 

Zhiming Chen and Xingye Yue 



Abstract. We consider the steady flow transport through highly heteroge- 
neous porous media driven by extraction wells. We develop a new upscaling 
technique lumping the small-scale details of the medium property into a few 
representative macroscopic parameters on a coarse scale, which preserve the 
large-scale behavior of the medium and are more appropriate for simulations. 
The method is based on the recently introduced over-sampling multiscale fi- 
nite element method and on the introduction of new base functions that locally 
resolve the well singularities. Numerical experiments are carried out for flow 
transport in both periodic and randomly generated log-normal permeabilities 
to demonstrate the efficiency and accuracy of the proposed method. 



1. Introduction 

The central difficulty in the modelling of subsurface flow and transport is to ac- 
count for the spatial variability in the parameters used to characterize the relevant 
physical properties of the natural porous media. In realistic situations, the pre- 
cise spatial distribution of the parameters which are required to characterize the 
problem is never available due to the lack of enough data. Thus, sophisticated ge- 
ological and geostatistical modelling tools are used in practice to generate highly 
detailed medium parameters, which are based on some site-specific measurements 
and on experience from other sites. There exists a vast literature on the upscaling 
or homogenization techniques that lump the small-scale details of the medium into 
a few representative macroscopic parameters on a coarse scale, which preserve the 
large-scale behavior of the medium and are more appropriate for reservoir simula- 
tions. We refer to the book of Christ akos [3] for more information on the random 
field modelling of the natural porous medium parameters, and to the recent review 
paper [15] on the existent upscaling techniques in the engineering literature. 
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The study of steady flow through highly heterogeneous porous media driven 
by extraction wells is of great importance in hydrology and petroleum reservoir 
engineering. It is observed in the engineering literature (cf. e.g. [5], [12], and the 
references therein) that in the near-well region, many of the existing upscaling 
methods do not provide satisfactory results. The explanation may be that the 
standard upscaling methods generally assume that the pressure field is slowly 
varying, which is clearly not true in the proximity of the flowing wells [5] . This fact 
may also be accounted for mathematically on behalf of the homogenization theory 
for second order elliptic equations with periodic coefficients. In the homogenization 
theory, multiscale convergence is ensured under the assumption that the source 
should be at least in H ~ 1 , so that the solution is bounded in the Sobolev space if 1 . 
As we will see below, however, well singularities can be modelled as Dirac sources 
and thus, in the near-well region the solution behaves like a Green function which 
is not uniformly bounded in if 1 . 

The recently introduced multiscale finite element method [8, 9] for solving 
elliptic equations with oscillating coefficients provides an effective way to capture 
the large scale structures of the solutions on a coarse mesh without resolving all 
the fine scale structures. The central idea of the method is to incorporate the 
local small scale information of the leading order differential operator into the 
finite element bases. It is through these multiscale bases and the finite element 
formulation that the effect of small scales on the large scales is correctly captured. 
In section 2, we will describe one engineering upscaling technique and discuss its 
relation with the multiscale finite element method. 

Let C R 2 be a bounded domain with Lipschitz boundary I\ Let us denote 
by B(xo,r) the disk centered at xq with radius r > 0. Let Bj — B(xj,5), 1 < 
j < A, be mutually disjoint subdomains inside ft that are occupied by the wells. 
Denote by ft s := We consider the following single phase pressure 

equation which is formed by combining Darcy’s law with the conservation of mass 

— di v (K(x)Vus) = 0 in f Is, (1) 



where u$ is pressure and AT is the permeability, which is typically highly variable 
in space. We will impose a homogeneous Dirichlet boundary condition us |r = 0 
on the exterior boundary. The other kinds of boundary conditions can be dealt 
with similarly without any essential difficulties. On the well boundary Tj = dBj , 
two quantities are of particular importance in practical applications: the well bore 

f du 

pressure udr and the well flow rate / K-——ds , where v is the unit outer nor- 

J r, dv 

mal to dft§. The boundary condition to be imposed on Fj is either the Dirichlet 
boundary condition which fixes the well bore pressure ay (assumed to be constant) 

'u>s\r j =OLj, ( 2 ) 

or the following mixed boundary condition which fixes the well flow rate q ■ 



us |r — Cj = const , 



ry* @ U 5 J 

K—d s 

ov 
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The constant Cj in (3) is unknown. 

The purpose of this paper is to develop a complete multiscale coarse grid 
algorithm, based on the multiscale finite element method, for solving steady flow 
problem involving well singularities in heterogeneous porous media. The additional 
well singularities of the problem are resolved locally by adding finite element base 
functions. The final coarse grid model is based on a variational formulation which 
is different from the heuristical techniques in [5, 12]. This rigorous variational 
coarse grid model allows us to establish a complete multiscale error analysis for 
locally periodic medium properties based on the homogenization theory. We em- 
phasize that, as pointed out in [6], the spatial periodicity assumption does not a 
priori restrict the applicability of the results only to media exhibiting such a strict 
repetitive spatial ordering in the properties of interest. Indeed, the numerical ex- 
periments carried out for random log-normal relative permeabilities demonstrate 
clearly the applicability of our method beyond the periodic structures. 

Since the size of the wells S is negligible in practical situations, the first 
approximation we can make is to replace (l)-(3) by the following problem 

N 

-div (K(x)S7u) = in ^ (4) 

j=i 

with the boundary condition u\r = 0. Here is the Dirac measure at Xj, 1 < j < 

N. Denote by Kj = K(xj ), (f)j = — ^ In \x — Xj \ and U = u — Y2f=i Note 

that the flow rates in (4), for i G Id , are unknown. They are determined through 
the following additional conditions which are obtained by requiring i^|r i « oli as 
the approximation of the boundary conditions (2): 

+ ^2q j <t) j (x i ) + U(xi) = a it i e I D . (5) 

1 j^i 

We derive the following error estimate between the solution u§ of the original 
problem (l)-(3) and the solution u of (4)-(5) 

N 

max \u — us\ < CS | In <5| |q-|. 

xen 5 r-f 

.7=1 

To our knowledge, this error estimate is the first in the literature, although the 
convergence of the solution u§ to u as S — ► 0 is well-known in certain sense (cf. e.g. 
[10]). Instead of presenting the full proof of the estimate, which can be found in [2], 
we will present in section 3 the key ideas of the proof in the simplified situation of 
a single well and constant permeability. In this case, the proof is more transparent 
and the estimate is sharper (without the logarithmic factors). 

To introduce our multiscale algorithm, we first introduce an equivalent varia- 
tional formulation for (4), which is suitable for our multiscale approximation. Let 
fij, 1 < j < iV, be mutually disjoint subdomains inside D such that Bj CC flj. 
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Let Gj be the Green function associated with the domain ftj 
— div (K(x)VGj) = in fij, Gj | Sj = 0, 
where Xj = dSlj. Now, for any v E Cq°(Q) we have 



){xj) = / — div (K{x)VGj)vdx 

jQj 

= [ K(x)VGjVvdx- [ 



K^p-vds. 

ov 



On the other hand, from (4) we know that 



N r 

= / K(x)VuVvdx. 

3 = 1 



( 6 ) 



Let Gj = 0 for x E Q\Qj and ( = u — then we know that £ E Hq( Q) 



satisfies the following variational form 

f K{x)VC,Vvdx = - / K lfr vdx W G C o°( Q l 

J Q, j__ i J S j 



( 7 ) 



Denote by Wj = Gj + - In \x — Xj\ in Qj, then the complementary condition 

27rKj 



(5) to determine for i E Id becomes 



--^^ InS + qiWiix^+Cixi) =ati, i € I D - (8) 

We note that for £ the singularities of the original solution u are removed and we 
will use the over-sampling multiscale finite element method to discretize it on a 
coarse grid. 

In section 5, we will discuss the implementation of our algorithm and show 
some numerical results for randomly generated coefficients to demonstrate the 
accuracy of the method. 



2. Upscaling of the permeability 

The purpose of this section is to show that one of the well-known engineering 
upscaling techniques (see e.g. [11]) is equivalent to the multiscale finite element 
method proposed in [7, 8]. We remark that the multiscale finite element method 
is shown to be convergent under the condition that the permeability is locally 
periodic K e {x) = K(x,x/e), where AT(x, •) is periodic with respect to the second 
variable. As a consequence, the convergence of the engineering approach described 
in this section is guaranteed. 

Let Mh be a finite element mesh of fl with the mesh size H much larger 
than the £, the characteristic length representing the small scale variability of the 
media. Usually, £ is equal to the correlation length in the statistical random field 
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modelling of the media. Let Wh be the standard conforming linear finite element 
space over Mh and W # = W# niTo(ft). In the engineering literature, the problem 



-div ( K £ (x)Vu £ ) = f in ft, 



0 on r 



( 9 ) 



is approximated by the homogenized or upscaled problem: find u* H £ such 
that 



[ K*(x)Vu* H Vv H dx= [ fv H dx Vv H G W&, 
Jn Jn 



( 10 ) 

/ft Jn 

K * being piecewise constant on the coarse mesh Mh- The so-called effective per- 
meability matrix K* on each T £ Mh is defined as follows. For any G £ R 2 , let 
6 £ be the solution of the problem 

-div ( K £ V9 £ ) = 0 in T, 6 £ \ dT = G • x. 

A simple integration by parts implies that 



V6t 






G. 



( 11 ) 



On the other hand, since Q = K £ V6 £ 
a matrix K * such that 

Q = K* -G 



I^Tj f T K £ V6 £ dx is linear in G , there exists 



-k £ vo £ 



-K*V0 e . (12) 

The multiscale finite element method introduced in [7, 8] is based on mul- 
tiscale finite element base functions. For any T £ Mh , let i = 1,2,3, be the 
linear nodal bases. Define , i = 1, 2, 3, as the solution of the local problem 

-div (K e V4>J) = 0 in T, <j>J\ dT = V’i- 

Denote by V(T) — span {(/)[, i = 1,2,3} and by Vh the finite element space 

VH = {v H eH 1 (n):vH\ T eV(T)}. 

Then the multiscale finite element approximation to (9) is: find uh £ V# = Vh H 
Hi (ft) such that 

I K £ Vu H Vv H dx = j fvndx Vvh £ V#. (13) 

Jn 7ft 

We have the following proposition concerning the relationship of these two 
upscaling methods. 

Proposition 2.1. Let A * and A be the stiffness matrices corresponding to the dis- 
crete problems (10) and (13), respectively. Then we have A * = A. 

Proof. It is clear that we only need to prove that the element stiffness matrices 
coincide, that is, for any T £ Mr and i,j — 1, 2, 3, 



X 



K*V'if i V'ifjdx 



-Jr 



K £ V<t>lVtfdx. 




92 



Z. Chen and X. Yue 



To do that, we note that by (11) we have V</>f = Since V^, V?/y are constant 
over T, we get by (12) that 



X 






\T\K*V(t>JVipj = J K'VtfWjdx, 



which, after integration by parts and using the definition of the multiscale base 
functions, yields 



x 



K*Vil>iVi)>jdx = [ (K e V<f>J ■ vtyds 

JdT 



This completes the proof. 




3. The mathematical modelling 

Let ft be a convex polygon in R 2 with boundary T. Let B(xo,r) = {x E R 2 : 
\x — xo\ < r}. Let Bj = B(xj,5), 1 < j < N, be mutually disjoint subdomains 
inside ft which are occupied by the wells. Set Tj = dBj and £1$ = ft\(U 
We consider the following problem 

— div (K(x)Vus) = 0 in ft<$, (14) 

uskj = const , J K^-ds = q :/ for j G I M , (15) 

U8\rj=ctj for jelo, M< 5 |r= 0 . (16) 

Here /#, 1m are index sets such that Id U 1m — {1, 2, • • • , TV}, while ay for j E Id 
and q- for i E Im are known constants. We imposed the following conditions on 
the coefficient K(x): 

(HI) K E C 0,1 (fi), and there exist constants A, A > 0 such that A < K(x) < 
A -1 and | K(x) — K(y)\ < A\x — y\ for any x,y E £1. As we are interested in 
highly variable coefficients, we can assume A > 1 without loss of generality. 

As already mentioned in §1, we will consider the following approximation of 
the problem (14)-(16): find u and the constants (q*}z<E/ D , such that 



N 

—div (K(x)Vu) = jS^ in ft, 
j= 1 



27 xKi 



ln <5+X + U(xi) 



= 



u|r = 0. 



for i e Id, 



(17) 

(18) 
(19) 
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Here 5%. is the Dirac measure at x ? -, Kj = K(xj ), 6j(x ) = - ^ In lx - x ? |, and 

j j j \ j, j\ , 27 vKj 

U = u — Ylf=i Qj^j- The relation (18) can be viewed as an approximation of the 
boundary condition u\ri ~ on for i G Id- 

The following error estimate is derived in [2]. 



Theorem 3.1. Let the assumption (HI) be satisfied. Let u§ and u be the solutions 
of (14)-(16) and (IT) —(19), respectively. Then there exists a constant 70 € (0,1) 
such that for 5 A < 70 the problem (17) -(19) has a unique solution. Moreover, 
there exists a constant C independent of 5, and A such that 



and, for every i E Id, 



N 

max | u — u$\ < CA(1 + In A) ly/2 ^^ lOjl^l l n ^|» 



3 = 1 



ki - qf I < CA(1 + In A) 1/2 ( ^ Iq^- 1) In 5| , 



where 



3 = 1 



q i = j Ke ~f^ ds f° ri ^ I D- 



The proof of this theorem is rather complicated. It depends on a sharp esti- 
mate between the well flow rate and the well bore pressure and an explicit depen- 
dence on p of the constant in the classical W 1,p interior estimate for second order 
elliptic equations. Instead of presenting the full proof of the estimate which can be 
found in [2], we will discuss in the sequel the simplified situation of a single well 
flow in media with constant permeability. 

The starting point of the proof is the following basic lemma. 



Lemma 3.2. Let = Fl\B(x o,d), F$ = dB(xo,5) and T = dQ. Consider the 
problem: 



—Av = 0 in v = 1 on Fs, v = 0 on f. 



Then 



2n fdv 2n 

lnDo/5 ~ Jy 5 dv S ~ Indo/S ’ 



where do = min xG r |x — Xq\ and Dq = max xG r \x — xq 



Proof. Let r — \x — xo|. Consider the following barrier functions 

+ = Hr/S) Hr/s) 

6 HDo/S)’ 6 Hdo/sy 

Let wf = vf — v. Then it is clear that —A wf = 0 in Cl's, wf = 0 on Fj, and 
> 0, wf < 0 on T. Thus, by the maximum principle we get w ^ > 0, wf < 0 
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in This yields v s < v < in By the definition of vf, we obtain 
In r — In 5 1 < v ~ 1 < In r — In 5 1 



ln(2?o/«) 



in Q$. 



□ 



r — 5 \n(do/5) r — 8 r — S 

By letting r — > 5 + , we get the desired estimate. 

A direct consequence of the basic lemma is the following corollary. 

Corollary 3.3. Consider the problem: 

-AC = 0 in C|r = 0? C|r 5 = Ps, J > 0- 

Then we have 

Now we consider the problem with a single well and constant permeability 

f du$ 

-Au$ = 0 in ftj, tii| r = 0, us\r s =ot 5 , J — 
and its limit problem 

—A u — q^ 0 in iz|r = 0. 

Let u — — ^ In \x — xo| + w and v$ = u$ — u. Then we have 
—Avs = 0 in tts, vs|r=0 

V6\r s =as + -^ln5-w\r s , J ^ds = 0 . 

The problem is now reduced to estimating || v§ ||o,oo,^ 5 - In order to do this, we first 
split vs into two parts: vs = v 1 +^ 2 , where v\ and V 2 satisfy 

— Avi = 0 in SI#, ^i|r = 0, Vi = w(xq) — w on f^, 



-ds 



( 20 ) 

( 21 ) 



and 



-A vo 



■■ 0 in tls, ^2 |r = 0, 

= <w + £ hi - »(*„) =: <4 £ ^ds = -j u % 7 



ds. 



fT s V" Jr 5 

Since w is smooth around xo, it is easy to see that \w — w(xo)\ < CoS on 1^, which, 
in turn, implies that \vi\ < CoS in Qs- 

Let 6 = CoSv ± fi, then —A 0 = 0 in 6 = 0 on T and 6 > 0 on Ts. Thus 
6 > 0 in Qs and < 0 on T. Therefore 



[ TT-to 






< c 0 s 


1 


Jr s dv 




Jr dv 




Jr 



C 0 d 



l 



dv 

—ds 

T, dv 



< 



2nC 0 5 
ln{do/6) ' 
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Now by the third equality in the definition of V 2 , we know that 

2ttCq6 



[ 9v2 j 

L dv 

which implies by Corollary 3.3 that 

Kl< 



< 



1 2irC 0 8 , (D 0 



In (do/ 8)’ 



2i r \n(do/5) 

A final application of the maximum principle gives |t7 2 | < CS. This proves the 
following theorem. 



Theorem 3.4. Let u$ and u be the solutions of (20) and (21), respectively. Then 
there exists a constant C independent of 6 such that the following estimate holds 



max | us — u\ < CS. 
xefis 



4. The multiscale method 

In this section, we are going to introduce our multiscale method to solve the 
problem (17)— (19), which is a good approximation of the original problem (14)- 
(16) for small 6 according to the analysis in last section. We make the following 
local periodicity assumption for the coefficient: 

(H2) K(x) = K e (x) = K(x,x/e) for some small parameter e > 0, and we 
assume K G C 1 (fi; (7p(R 2 )), where C*( R 2 ) stands for the collection of all 
the C 1 (R 2 ), periodic functions with respect to the unit square Y. 

Our method is based on the over-sampling multiscale finite element method 
in [7] , which reduces the so-called resonance error in the previous multiscale finite 
element method in [8, 9]. We first briefly recall the basic facts of the over-sampling 
multiscale finite element method. Let Mh be a regular and quasi-uniform trian- 
gulation of Q, and Vh the standard conforming linear finite element space over 
Mh • For any T G Mh with nodes {xj }f =1 , let Hr denote the size of T, P\(T) 
the set of linear polynomials defined in T, and {cpj}^ =1 the basis of P\(T) sati- 
sfying = Sij,i,j = 1,2,3. For any T G Mh-> we denote by S = S(T) a 

macro-element which contains T and satisfies the following condition: 

(H3) Hs < C\Ht and dist ( dT,dS ) > SqHt for some positive constants C\, £o 
independent of H. The minimal angle of S(T) is bounded from below by 
some positive constant Qq independent of H. 

Let MS(S) be the multiscale finite element space spanned by = 1,2,3, 
'ipf G H 1 (S) being the solution of the problem 

-div {K e (x)ViPf) =0 in S, ^\ ds = V?- 

Here {ipf }f =1 is the nodal basis of Pi(S) such that ipf(xj) = 5{j , i,j — 1, 2, 3. The 
over-sampling multiscale finite element base functions over T is defined by 

^ = c[ 3 ^\ t in T, 
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with the constants cT chosen in such a way that 

inT - 

The existence of the constants cL is ensured from the fact that {<Pj}^ =1 also forms 
the basis of Pi(T). In practical applications, the base functions ipf of MS(S ) 
will be further approximated by solving (22) on a mesh of 5, with a mesh size 
resolving £ and using linear finite element methods [7]. These base functions can 
be preliminarily computed to generate the coarse grid operator, which turns out 
to be only a small overhead of the overall computations. This is also a common 
practice in many upscaling methods [15]. 

Let OMS(T) = span {^}? =1 and I It : OMS(T) — > P\(T) the projection 

n T ^ = avl if r/> = atif g OMS{T). 

Let Xh be the finite element space 

X H = Wh : G OMS(T), VT e M H } 

and define n# : Xh —> Ht£M h Pi(T) through the relation TLhiPh\t = II t^Ph for 
any T E Mh, ip h G Xh- The over-sampling multiscale finite element space is then 
defined as 



Xh = {(pH G Xh : II h^Ph G Vh C H l (Q,)}. 

In general, Xh *t- H 1 ^) and the requirement II h^Ph G Vh serves to impose some 
continuity of the functions ipn G Xh across the interelement boundaries. 

For any j = 1,2,*** , N, let Qj be a convex polygon inside which consists 

of a union of elements in Mh such that Bj CC flj. We assume D Clj = 0 for 
any i ^ j. Let Gj be the Green function 

-div (K £ (x)XG £ ) = 6*. in Qj, G £ | s . = 0, 

where Ej = dQj. Let u £ and {q i}iei D be the solution of the problem (17)-(19). 
Then the argument of §1 yields that ( £ = u £ — Ylj=i QjG £ j G Hq(Q) and the 
constants {q i}iei D satisfy 

f N f dG £ 

J K E (x)V( £ V X dx = - X / Ke ~£h7 xds e 

-x%rl n <J + qi^i(Sj) + Ce(Xi) = OLi V* € I D - 

JjlT ^ 

Here and in the sequel G £ is extended to be zero outside fij, and Wj = G £ — (pj 1 

with (bj = — In \x — x A. 

r 3 2 nKj 1 J 1 
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Let Xjj = {ipH £ Xh • II n'lpH — 0 on T}. Then we introduce the following 
discrete problem: find (h £ X^ and {qf^ € / D suc ^ that 

p N r QG e 

K e {x)VCHVXHdx = Ke-^XHds V Xif e4, 

TeM H ^ T 3 = 1 V 

qH 

+ q^w i (x i ) +C H (xi) = ai Vi £ I D , 

where \ H = n hXh G V ( j = Vn n Hq(£1), and we set qj 7 = q^ for j £ Im to 
simplify the notation. 

We make the following assumption on the local domains fh,. 

(H4) Let Hj = dist (xj, Ej): there exist constants C 2 , C 3 > 0 such that < 
Hj < CsHi for any i, j — 1, 2, • • • , N. 

In practical applications, the most interesting quantities are the well bore pressure 
for the wells prescribing the flow rate and the flow rate for the wells prescribing 
the well bore pressure. The following theorem establishes the error estimate for 
these quantities. 



Theorem 4.1. Let the assumptions (H1)-(H4) be fulfilled. Then there exists a con- 
stant C independent of s, 6, H and Hj such that, for j E Im, 



max | 

xer. 



:|Ce-0 1 (x j )\<c{h + 

and, for j £ I D , 






In 



H 



N 



+ h + \Ijtj i lni? iEi^ 



H 



i—1 



|q,--qf|<C'{H+|^ 






In 



H 



£ 




N 



|lniL|^|q.|. 



i = 1 



The proof of this theorem can be found in [2], in which, some sharp bounds 
on the homogenization of the Green functions are derived. 



5. Numerical results 

We start this section by summarizing the algorithm proposed in this paper for sol- 
ving the problem (17)— (19) , which is a good approximation of the original problem 
(14)-(16) when the size of the wells is negligible. 

Algorithm 1. Given the well bore pressure oti for i E Id and the well flow rate 
q^ for j E Im, the following procedure finds the approximate well bore pressure 
af for j E Im, the approximate well flow rate q^ for i £ Id, and the approximate 
pressure u H = ( H + 1 Gj , where q f = q^ for j £ I M . 
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• For j — 1, • • • ,N, compute the discrete Green function Gj on each sub- 
domain flj, i.e. Gj G Vf’iflj) such that 

[ K{x)VG h j Vv h dx = v h {x j ) VvneVSiQj). (22) 

Ju j 

Here V^(Qj) = V h (Qj) n i#(fij), and V h (Qj) is the standard conforming 
linear finite element space over the mesh M.h(ftj) of ftj, with the mesh 
size h resolving the scale of the permeability field K(x). Compute the 
associate effective radius fj from Gj according to the following relation 

oirln rj= [ {K — Kj)V(f>\7G^dx — j KVGjVip h , (23) 

^ 3 J Clj 

where 0 = ~~ 2 -kK ' n \ x ~ x j\ arl ^ i’h € Vh(£lj), with nodal values defined 
by 

J, (~ \ - f ^( X k) if X k G 9Slj, 

\ 0 otherwise. 

Thus the approximate value of the Green function Gj on Tj is 
a^C^xj)-- = K(xj). 

• Use the over-sampling multiscale finite element method to find (h € 
and {<\f}iei D such that 



/ N p QQ h 

K{x)VC H V X Hdx = - X)qf / K—^XHds \/ X h € X° H , 



TeM 



qf af + (if(xi) = OLi for i E I D . 

• Compute the approximate well bore pressure for j E Im through the 
relation 



af = C H {xj) +qj a 1 }. 

The formula (23) defining the effective radius extends the well-known Peace- 
man method [13, 14] in the engineering literature, for five point finite difference 
discretization and constant permeability, to the general case and it is convergent 
for any finite element meshes and any heterogeneous porous media, as proved in 

[ 2 ]- 

Now we report one numerical example carried out for random log-normal 
permeability to show the accuracy and efficiency of our method. More extensive 
numerical experiments can be found in [2]. The computations are performed on 
the unit square ft = (0,1) x (0,1). We consider two wells Bj = B(xj,S) with 
x\ — (^,§), x 2 — (§,|). We impose the mixed boundary condition (3), which 
fixes the well flow rate q - for both wells. Thus we have = 0 and the interested 
quantities are the well bore pressure (WBP) on each well. In the computations, 
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Well No. 


Radius 


‘Exact’ WBP 


WBP 


Relative error 


1 


l.E-05 


-1.7278769 


-1.7013823 


0.1533E-01 


1 


l.E-04 


-1.3540093 


-1.3275152 


0.1957E-01 


1 


l.E-03 


-0.9801418 


-0.9536476 


0.2703E-01 


2 


l.E-05 


3.7894742 


3.8199386 


0.7773E-02 


2 


l.E-04 


2.8264341 


2.8558985 


0.1042E-01 


2 


l.E-03 


1.8623940 


1.8918584 


0.1582E-01 



Table 1. Results of the Algorithm 1 in the case of random log- 
normal field with a 2 — 1.69 and l x — l y — 0.01. 



we take q : = — 1 and q 2 = 1, which corresponds to the situation in which the well 
B\ is an extraction well and B 2 is an injection well. 

The domain Qj is taken to be the square centered at Xj with the length of 
the sides being 2 Hj so that Hj — dist (xj , d£lj ) . The discrete Green function G 1 - 
in (22) is computed on a uniform finite element mesh of size h = iJj/512. 

We generate the random field K(x) on a uniform 1024 x 1024 mesh by u- 
sing the moving ellipse average technique [4] with the variance of the logarithm 
of the permeability a — 1.3, and the correlation lengths l x = l y = 0.01 in the two 
space directions. Figure 1 shows a realization of the random permeability field. 
The “exact” well bore pressures computed by on the fine 1024 x 1024 mesh are 
on = -1.7278769, a 2 = 3.7894742. 

Table 1 shows the results computed by using the Algorithm 1 on the coarse 
64 x 64 mesh with the size of the local domains H\ — H 2 — B . The mesh size for 
solving the over-sampling base functions is 1/1024. It clearly shows that the WBP 
by our method on the coarse 64 x 64 mesh is a very good approximation of the 
“exact” WBP on the fine 1024 x 1024 mesh. 
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Figure 1 . The random log-normal permeability field K(x), The 
ratio of maximum to minimum is 5.0590E+04. 
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On Plasma Expansion in Vacuum 

Pierre Degond, Celine Parzani, and Marie-Helene Vignal 



Abstract. In this paper, we propose a model describing the expansion of a 
plasma in vacuum. Our starting point consists in a two-fluids Euler system 
coupled with the Poisson equation. Since numerical simulations of this model 
are very expensive, we investigate a quasi- neutral limit of it. We show that 
electron emission occurs at the plasma- vacuum interface. This emission is well 
modelled by a Child-Langmuir law. The difficulty consists in accounting for 
the motion of the plasma- vacuum interface. In this paper, we formally and 
numerically justify why electron emission produces a reaction pressure which 
slows the plasma expansion down. 



1. Introduction 

This paper is devoted to the study of a quasi-neutral plasma expansion in vacuum, 
and more particularly to the description of the motion of the plasma- vacuum in- 
terface. This configuration occurs in some diodes like the ones mentioned in [10]. 
So, let us consider an external electric field applied between two electrodes and 
assume that a quasi-neutral plasma constituted of ions and electrons is emitted 
from the cathode. This plasma undergoes a thermal expansion towards the anode 
while the electrons are emitted from the plasma-vacuum interface according to a 
Child-Langmuir law [3, 7]. The gap between the interface and the anode is de- 
creasing with time and the extracted electron current is simultaneously increasing 
in accordance with the Child-Langmuir law. 

Our starting point is a one-dimensional Euler model for each species (ions and 
electrons), coupled with the Poisson equation. However, due to the large plasma 
densities, numerical simulations of this model are very expensive in practice. So, 
we rather investigate a one-fluid Euler model obtained as a formal quasi-neutral 
limit of the initial two-fluids model. Moreover, during the thermal expansion of 
the plasma, electrons are emitted from the plasma- vacuum interface to the anode. 
The quasi-neutral model is unable to describe their motion between the plasma- 
vacuum interface and the anode but by means of a rescaling of the system, we show 
that this emission is well modelled by a Child-Langmuir law. Then, the boundary 
conditions at the interface X(t), which connect the two models, are determined 
through two additional assumptions validated by numerical simulations. Namely, 
these assumptions are: 
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i) The electron emission is governed by the Child-Langmuir law with a ma- 
ximal extracted electron current. 

ii) Electron acceleration in vacuum produces a reaction pressure which acts 
on the plasma-vacuum interface and slows its motion down. 

A similar asymptotic analysis is performed in [4] for the isentropic Euler 
system. Here we extend it to the full Euler system. We detail assumptions i) and 
ii) and numerically justify them by comparing simulations obtained on the one 
hand with the two-fluids Euler system and on the other hand with the quasi- 
neutral model. 



2. Two-fluids Euler-Poisson model 



Let us consider a pair of electrodes: the cathode is located at x = 0 and the anode 
at x = L. A plasma constituted of ions and electrons is emitted from the cathode. 
Let i and e be the index for ions and electrons. Then, are the masses of the 
particles, 7V^ e their densities, C/^ e their velocities and T^ e their temperatures. The 
total energies Wi^ e are defined by 



W P 



7-1 



Pe + 



m. 



n k ul 



Wi = 



1 „ , mjNjUf 

7 - 1 1 + 2 



where 7 = 5/3 and the pressure laws are given by P l e = A r J t kg T t e , kg being 
the Boltzmann constant. We start with the Euler system for each species, coupled 
with the Poisson equation. Thus, the system for ions is given by: 



' N i{ + (NiUi) s = 0, 

< rm {(Ni Ui)i + ( Ni Ufh) + Pi- = - qNi 
k Wi- t + (Wi UiU + (Pi Ui) s = -qNi Ui 
while electrons are described by: 



' N e - + (N e U e ) s = 0, 

< m e ((N e U e ) { + (N e Ui) s ) +P es =qN e 
w w e{ + ( W e U e ) £ + (P e U e ) s = qN e U e 



where q > 0 is the elementary charge. 

The Poisson equation is used to take into account the evolution of the electric 
field, so that: 

= — (Ni - N e ), 

£0 

where $ is the electric potential and £o is the vacuum permittivity. 

We assume that a quasi- neutral plasma is emitted at x = 0, with the same 
velocities and temperatures for the two species. So, the boundary conditions for 
the plasma are given by: Ni(x = 0) = Af e (0) = TVo, Ui( 0) = U e ( 0) = C/ 0 , Ti(0) = 
T e (0) = T 0 , and the ones for the electric potential are: <f>(0) = 0, 4>(T) = $l- 
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3. Quasi-neutral limit in the plasma region 



Now, in order to get a quasi-neutral model, we scale these equations with the fol- 
lowing characteristic sizes: the gap between the two electrodes L, the thermal ionic 
velocity U t h = \/To/rai, the time r = L/U t h , the density of the emitted plasma 
No , the temperature at the cathode To, the anode potential and the internal 

energy at the cathode Wo = NoTq. This leads us to introduce some parameters: 

_ . m e 

- the mass ratio: e = — , 

m l 

- the ratio of the internal energy at the cathode to the applied potential 

T 

energy: 77 = —— « 1, 

Q®l 

- the ratio of the applied potential energy to the coulombian interaction 

. £o 

enerEy: A = 

- the ratio of the emission velocity at the cathode to the thermal ionic 

U 2 

velocity: a = — 

"th 

Next, writing the Euler-Poisson system with the scaled variables: x, £, </>, n^ e , 
Wi,e, and p iiG where x = Lx, t = rt, $ = N iiC = N 0 n iie , U^ e = U 0 u i: e , 
Wi, e = WoWi, e and P i>e = iV 0 T 0 pi, e , we get: 

( n it + (riiUi) x = 0, 

Tli 4 *x 



(rii Ui)t + rq )x T Pi x — 



V 

Tli Ui (f) x 



(i) 



W t t + (Wi Ui + Pi Ui) x = - 

•I 

( 77-g^. T (n e Ue)x — 0? 

e {(n e u e )t + {n e u 2 e ) x ) + p ex = 



Tie 4*x 



Tl e U e (f) x 



with 



Wj 



W et + (W e U e +p e ^e) a 

'I 

A 0 xx — n e , 

1 1 O 1 

r Pi + x rii u { , w e = - 

7 - 1 2 7 - 1 



Pe + ^ n e U e> 



and the boundary conditions are given by: 

f ni(x = 0) = n e (0) = 1, Ui( 0 ) = u e ( 0 ) = y/a, 

1 a\ _ ( 1 



Wi( 0) = 



a 

7-T + 2 



w e (0) = 



7- 1 + T" 



( 2 ) 

(3) 

(4) 

(5) 



l Pi(0)=p e (0) = l, 0(0) = 0, 0(1) = 1. 

We are now interested in the limit 77 — > 0 with A = 0(1) and a = 0(1). These 
values agree with the ones computed in some high current diodes [10]. Since the 
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forthcoming analysis is independent of whether e is assumed to be small or not, 
we assume 6 = 0(1). 

So, let X(t) be the plasma- vacuum interface position at time t. Then, when 
77 goes to 0, we formally obtain that n e — > n, n* — » n, u e — > u, Ui — > u and 
(wi + w e ) — > w. Furthermore, n, 7/ and w, the density, the velocity and the total 
energy of the quasi- neutral fluid, satisfy: 



' n t + (nu) x = 0, 

(l+e) ((nu) t + (nu 2 ) x )+p x = 0 , 
w t + ( wu + pu) x = 0, 

, <t>(x) = 0 , 

for all x £ [ 0 ,X(t)}, where 



1 . x nu 2 

w= -p+(l + e)-— and p = Pi + p e , 

7 — 1 l 



with the following boundary conditions: 

( n(x = 0) = 1, 



< 




u(0) = 
a {l + e) 
2 



( 6 ) 



Let us notice that only the total pressure p = pi + p e is given. To compute each 
pressure term, p e for example, we need to solve the additional non-conservative 
equation (see [1] for more details): 



p ef T U e p e x T 7 Pe U e x = 0. 

Moreover, the quasi-neutral model (6) deduced from ( 1 , 5 ) in the limit 77 — > 0 , is 
valid only when the density n is strictly positive. A rigorous proof of this limit was 
given in [ 2 ] assuming the density strictly greater than a positive constant. The same 
condition appears for the quasi- neutral limit of the drift-diffusion model in [8]. So, 
the quasi- neutral model (6) is no more valid close to the interface X ( t ) where ions 
and electrons are moving apart. Therefore, a different asymptotic regime must be 
investigated in order to describe the behavior of the electrons in the region [X(t), 1 ] 
(the “beam”). 



4 . Child-Langmuir limit in the beam region 

Under the action of the anode potential, the electrons are accelerated while the 
ions are slowed down. Beyond the interface X(t), there are no ions. The electrons 
are described by equations ( 2 , 3 ) with rii — 0 . Due to the external electric field, the 
electron velocity is increasing to reach a value of the order of {er\)~ x l 2 . In order to 
investigate the behavior of the electrons in the beam, we must rescale the electron 
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velocity. So for all x £ [X(t), 1], we set u e = u e /y/erj. This scaling transforms the 
system (2, 3) into 



n e . — (n e — 0, 

y/erj 

( 1 x 1 , _ 2x \ n e 4> x 

£ —p= ( n e u e ) t + — ( n e u e ) x \ + p e = 

\y/erj srj J rj 



’h/H . 



l A $ xx = Tl ei 



/y ijj \ — 

(n e uf) t + — — - (Pe U e ) x + - (he ul) x = he U e <f> x , 

y 1 a 



with the boundary conditions cf>(X(t)) = 0, 0(1) = 1 and u e (X(t)) = 0. 

When rj goes to 0, we formally get a stationary model (which only depends 
on time through the interface position: X(t)) such that for all x £ [X(t), 1]: 

(n e Uq) x = 0, 

(jl e U e ^ x = We 0a:? 

(7) 

— ( Tl e U^x = Tl e U e (fixi 
K ^ <fixx = Tl e . 

Note that the third equation can be easily deduced from the other ones. This 
system is known as the Child-Langmuir model (see [3, 7]). Its solution depends on 
a free parameter, the current j — n e u e , where j ranges in the interval [0, jczj- The 
maximal value of j, called the Child-Langmuir current, is given by the relation 

4^2A 

kl = 

and is associated with the additional boundary condition (fi x (X(t)) = 0. We shall 
assume that the emitted current coincides with the Child-Langmuir current, and 
shall verify numerically that this hypothesis seems fulfilled. 



5. A model for the plasma-beam interface 

Now, we have to connect the quasi-neutral model (6) and the Child-Langmuir 
model (7). The boundary conditions at X(t) result from the dynamics of the par- 
ticles confined about the interface. A formal asymptotic analysis will be developed 
in a future work (see [5]). In the present work, we shall rely on some hypotheses. 
We set that the total momentum is preserved while the electrons are crossing the 
interface. The acceleration of the electrons towards the anode generates a reaction 
pressure which slows down the motion of the plasma- vacuum interface. 

So, let us determine this pressure term. Let a = dX/dt be the interface 
velocity. We denote by [/] the jump of the function / across the interface, by /_ the 
limit of / on the left of X (t) and by /+ the one on the right. The Rankine-Hugoniot 
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relations applied to the conservation of the ion density across the interface leads 
to: 

[rii (ui - a)] = 0. (8) 

Similarly, using the total momentum equation, we have: 

A 



Ui u 2 + sn e u 2 e +p e +pi - \(j) x \ z ) - a ( riiUi + £n e u e ) 

2 rj 



= 0 . 



( 9 ) 



Now, from equations (1, 3), the conservation of total energy can be written as: 

{wi T W e ^t ~p ( Wi Ui -p Pi Ui “b VJ e U e -p p e U e ^ x — (0x)t 4*x ~ 7) (^x)t’ 

rj 2 rj 

and applying the Rankine-Hugoniot conditions, we get: 



(WiUi -p Pi Ui -p VJ e U e -p Pe Ue) ~ G ( Wi + W e + 7^-|0:z| 2 

2rj 



0. 



( 10 ) 



Moreover, we suppose that the quasi-neutral limit is still valid up to the interface 
X(t), such that: 

n e _ = n_, m_—n _, u x _ = u_, u e _ — (11) 



where n_ and denote the limit of the quasi- neutral quantities on the left of the 

interface. 

There are no ions beyond the interface, hence 



= 0, and Wi+ — 0. 



( 12 ) 



With the condition of maximal current given by the Child-Langmuir law and the 
quasi- neutrality of the plasma, we get: (j) x _ = </> x + = 0. Next, assuming that the 
electron density has no jump across the interface, we obtain: n e+ = n_. We have 
seen that the electron flow in the gap between X ( t ) and the anode is given by the 
Child-Langmuir current such that: jcL = jcL/y/^V = n e + (u e + — a). From (8) 
we deduce the interface velocity: 



dX 

dt 



= a = U-. 



(13) 



and (9) reduces to: 

£ [n e u e (u e - a)] + [p e + p { \ = 0. 

Therefore, since u e _ — U- — a, n e+ {u e + - a) = jcL , Pi+ = 0 and n e + = n_, we 
have: 

( jcL \ 

— +a\+p e+ . (14) 

Using Wi + = 0, Ui_ = a and <t> x - = <f>x+ = 0, (10) becomes 
[Pi Ui + w e u e + p e u e — a w e } = 0. 



Now from (11) and (12), we deduce: 

= Pi_Ui_ + Pe-U e _ = W e+ ( U e+ -a)+ Pe + U e+ , 
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then thanks to the definition of w e + and to the relation n e+ (u e + 
get 



p-u- =p e + 



7 Jcl 
7 — 1 n_ 



+ a 



£ . 

+ ~j3CL 



Jcl 



+ cr 



2 



- (?) =jcL , we 



(15) 



Solving the system (14) and (15), we determine p e+ and p_. The pressure term 
p e+ is given by: 



Pe + 



7-1 



n_ u 



JclX 
n - ) 



Therefore, substituting this expression into (14) , we can deduce the momentum 
flux p_ in the plasma resulting from the electronic emission at the interface and 
finally the energy flux p_u_. 



6. Numerical validation of the asymptotic model 

This section is devoted to the numerical study of the quasi-neutral model for the 
plasma (6) coupled with the Child-Langmuir model (7) for the “beam” by the 
momentum and energy fluxes p- and p_u_. So, let us consider the quasi-neutral 
system (6) written in the conservative form (see [9], [11]): 

fr + = °’ y (°- x ) = °> ^M) = v Q (i6) 

where V = (n, (1 e)nu, w) T and F(V) = (rm, (1 + e)nu 2 + p, wu + pu) T . 
Let us consider a uniform grid constituted by N cells (rrik)k=i,--- ,n of size Ax. We 
denote by At the time step and we set t n = nAt for all n > 0. Then for n > 0 
and k E (1, - * - , N}, V£ is an approximation of V at time t n on the cell rrik and 
X n is the approximation of the plasma- vacuum interface position X at time t n . 
Let us denote by rrik 0 (fco ^ {1, • • • , N }) the cell such that X n E rrik Q . Using the 
evolution equation of the interface velocity (13), we get: X n+1 = V n + A tv% . The 
system (16) is discretized with a classical finite volume method (see [11]). So, we 
describe the scheme only in the last cell occupied by the plasma, i.e. mfe 0 . 

Using the Green formula and taking into account the interface motion, we 

have 

(X " +1 - X k0 _ l/2 ) V-(X n - X k0 _ 1/2 ) v k n 0 = At (gl_ 1/2 - gi) 

where 9^ Q _ 1 / 2 is the numerical flux through the interface between the cells rrik 0 - 1 
and rrik 0 at time t n . It is computed using classical schemes like the HLLE scheme or 
the “polynomial” upwind scheme (see [6], [11]). Moreover, g ™ is the numerical flux 
through the interface, we set g™ — (0, (p-) n , (p_u_) n ) T . Finally if X n+1 E m^, 
that is if the plasma-vacuum interface has not left the cell rrik 0 during the time 
step, we set Vj^ +1 = V, else if X n+l E nrik 0 +i we set V^ +1 = = V. 

We compare the simulations given by the two-fluids and the asymptotic mod- 
els. We use Ax = 2.1(T 4 , 7 - 5/3, e = 5.KT 1 , rj = 10“ 4 , A = 10“ 3 and a = 1. 
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Figures 1 and 2 show the analysis of the plasma between the cathode and the inter- 
face X(t). Contrary to a classical fluid expansion in vacuum, in our case the ionic 
fluid is slowing down. The interface velocity given by the asymptotic model seems 
to be correct in a qualitative way. The values numerically used for the momentum 
and energy fluxes and p-u _ are here: (3p _ and (3p-U _. The parameter (3 has 

to be adjusted according to the two-fluids simulations. Here, (3 — 0.5; note that 
without this correction the interface velocity given by the quasi-neutral model is 
a little lower than the one of the two-fluids model, but its order of magnitude is 
correct. 

Figure 3 shows the electron dynamics in the beam. Plotting the electron 
velocity, we can see that the acceleration of electrons from the interface to the 
anode is well approximated by the Child-Langmuir law. 

Figure 4 gives the electric potential computed from the Poisson equation on 
the one hand and by the Child-Langmuir model on the other hand. We can observe 
that the potential is equal to zero between the cathode and the interface, which 
justifies the quasi-neutral approach. We also show ( right picture) that considering 
a constant Child-Langmuir current in the gap between X(t ) and the anode is a 
good approximation. 
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Figure 1 . Densities and speeds of the ionic fluid given by the 2- 
fluids model compared to those given by the quasi-neutral model: 
values observed between the cathode x = 0 and the interface at 
times t = 0.04r, t = 0.08r, t = 0.12r. 
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the 2-fluids model (right picture) 
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Towards the Thermodynamic Modeling 
of Nucleation and Growth 
of Liquid Droplets in Single Crystals 

Wolfgang Dreyer and Frank Duderstadt 



Abstract. Stress assisted diffusion in single crystal Gallium Arsenide (GaAs) 
leads to the formation and growth of unwanted liquid arsenic droplets in a 
solid matrix. This process happens during the heat treatment of single crystal 
GaAs, which is needed for its application in opto-electronic devices, and it is 
of crucial importance to pose and answer the question if the appearance of 
droplets can be avoided. To this end we start a thermodynamic simulation of 
this process. Special emphasis is given to the influence of mechanical effects 
on chemistry, diffusion and interface motion in GaAs. 



1. Introduction of the complete problem 

Stress assisted diffusion in single crystal Gallium Arsenide (GaAs) leads to the for- 
mation and growth of unwanted liquid droplets in a solid surrounding, [14], [15], 
[4], [5]. This is an elaborate thermodynamic process involving chemical reactions, 
diffusion, motion of phase boundaries, and all these phenomena are strongly influ- 
enced by local mechanical stress fields. The resulting model consists of a coupled 
system of algebraic equations, partial differential equations and conditions at the 
appearing liquid/solid interfaces. The variables are the mole fractions of various 
constituents, which appear in semi-conducting GaAs, the mechanical strain, and 
a phase density, indicating the size distribution of droplets. 

Single crystal GaAs contains the major elements gallium and arsen, and additional- 
ly various trace elements, which are of most importance in order to fabricate semi- 
conducting or semi-insolating GaAs, respectively. All constituents occupy three 
sublattices of face-centered cubic (fee) symmetry. The sublattices are indicated by 
three greek letters: a , /?, 7 . The lattice a is dominantly occupied by gallium, while 
the arsenic is the major substance on the lattice sites of sublattice (3. The sublattice 
7 indicates an interstitial lattice, where the dominant elements are vacancies. For 
a proper application of single crystal semi-conducting or semi-insolating GaAs as a 
wafer material, it is crucial that the trace elements are distributed homogeneously 
on the lattice sites of the three sublattices. In order to remove dislocations, which 
appear during the process of crystal growth, a special heat treatment of the wafer 
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becomes necessary. However, the heat treatment might start a diffusion process, 
where the trace elements move preferably towards the proximity of the dislocations. 
There the trace elements find thermodynamic conditions so that the formation of 
liquid droplets, dominantly formed by the arsenic, sets in. However, the arsenic 
precipitates appear also in regions free from dislocations. 

The first Figure shows two graphs, which represent the morphology of single 
crystal Gallium Arsenide on the rara-scale. 



A 



* 

The graph on the left hand side is a photo-luminescence mapping, and shows 
the distribution of dislocations. The cellular structure is typical for Gallium Ar- 
senide: the dislocations are dominantly arranged in rings, with an inner region, 
which is free of dislocations. The graph on the right hand side of the Figure shows 
the distribution of droplets. A careful study of this graph reveals that large droplets 
appear in the cell wall, i.e. in the vicinity of the dislocations. Adjacent to this ring 
of droplets, there is a region without droplets, and the center of the dislocation ring 
is occupied by small droplets. The number densities of small and large droplets 
are of the same order. 

We are thus confronted with the problem to describe the thermodynamic nucle- 
ation and growth conditions of liquid droplets within a solid matrix. In particular, 
we will study the influence of local mechanical stresses on chemistry, diffusion and 
interface motion. Stress fields arise due to dislocations, eigenstrains due to point 
defects and due to misfit strain. The latter describes the phenomenon that the 
arsenic liquid phase, which appears as inclusions, has a larger specific volume than 
the solid phase. 





2. Some physical and chemical properties of 
semi-insulating/ semi-conducting GaAs 

2.1. The phase diagram 

The phase diagram of GaAs, which is shown in the following Figure, indicates the 
various phases and their boundaries. Below the liquidus line, we find, left and right 
of the stoichiometric line, i.e. (arsenic) composition y = 0.5, two regions where the 
solid GaAs crystal may coexist with the liquid. Right and left from y = 0.5, the 
crystal is As-rich and Ga-rich, respectively. These regions are bounded from below 
by the two eutectic lines at 29.5°C and 810° (7, respectively. The two regions below 
the eutectic lines indicate mixtures consisting of the solid GaAs crystal as matrix 
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material, and containing solid Ga-rich and As-rich precipitates, respectively. In 
this study, we are exclusively concerned with thermodynamic processes in the 
two-phase region right from the stoichiometric line and above the 810°C eutectic. 




The GaAs crystal at the stoichiometric composition consists of two fee sublat- 
tices that are displaced along the diagonal of the elementary cube, i.e. the crystal 
has the symmetry of Zinkblende. However, the ideal crystal at y = 0.5 is neither a 
semi-conductor nor a semi-insulator. The latter one may form if the crystal com- 
position is slightly on the As-rich side. Moreover, there are further constituents 
appearing in GaAs. These are Oxygen (O), Silicon (Si), Bor (B) and Carbon (C) 
in very small quantities, but nevertheless these trace elements induce very im- 
portant phenomena. Above the 810 ° C eutectic there is a region, which is called 
region of homogeneity, where the excess amount of As and the aforementioned 
trace elements behave like point defects that are homogenously distributed over 
the two already mentioned sublattices and on a further sublattice, which is formed 
by interstitial lattice sites. 

The next Figure shows the region of homogeneity ( H ) as it was calculated on 
demand for Freiberg Compound Materials (FCM) by H. Wenzl and K.Mika -for 
details of the underlying model, see [15]. During a heat treatment of the crystal, 
the boundary of the region H may be crossed, as it is indicated in the Figure. If 
this happens, unwanted liquid As-rich precipitates appear. Thus the knowledge of 
the boundary of the region H is of crucial importance for the proper application 
of GaAs as a wafer material. 
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The calculated region of homogeneity depends sensitively on the assumed chem- 
istry, i.e. on the chemical reactions which are taken into account. 
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One of the objectives of this study is the determination of the mechanical influence 
on the region of homogeneity. The mechanical influence arises due eigenstrains, 
which are induced by point defects and dislocations, and due to misfit strain, which 
is induced by a volume change during the solid/liquid phase transition. Regarding 
the non-mechanical parts of the thermodynamic theory, we rely on the chemical 
model given by Wenzl et. al., [15], in collaboration with the wafer manufacturer 
FCM, and which will be discussed in detail in the following Section. 

2.2. The chemical constitution of semi-insulating As-rich GaAs 

The chemical model of Wenzl et. al. [15] is suited to describe the behaviour of 
semi-insulating As-rich GaAs that is fabricated by Freiberg Compound Materials 
(FCM). 

The GaAs crystal is assumed to consist of three sublattices, indexed by {a,/?, 7 }. 
Among them there are the already introduced two fee sublattices, which are mainly 
occupied by Ga- and As-atoms, respectively, and the fee interstitial sublattice, 
whose lattice sites are mostly occupied by vacancies. 




The above Figure gives a sketch of the elementary cell of the crystal. The lattice 
sites at the corners and faces of the cube form the sublattice a, which is mainly 
occupied by Ga atoms. In the interior of the elementary cell of the sublattice a 
there are further lattice sites, viz. four octahedral and eight tetrahedral lattice 
sites. Four lattice sites of the tetrahedral sites form the elementary cell of the (3 
sublattice, indicated in the Figure by light balls. The remaining four tetrahedral 
sites form the sublattice 7 . It is important to know that the available space of 
a lattice site of the (3 sublattice is equal to the corresponding space in the 7 
sublattice. The available space for octahedral sites is too small for an occupation 
by atoms, and thus the octahedral lattice remains empty. 

The number densities of sublattice sites are denoted by n Q , np, n 7 . There holds 
n a = np = n 7 = usl • The free electrons and free holes do not need lattice sites, 
also in regions, which are free of dislocations, rather they form the electronic gas. 
The constituents of the GaAs crystal are the chemical elements Ga, As , O, Si, 
B , C, and vacancies V, free electrons e and holes h, which are also treated like 
chemical elements. For a proper description of semi-conduct ing/semi-insulating 
GaAs, the list of constituents must be extended because the elements A s, O, Si, 
C and the vacancies V may also occur as charge carriers, which is indicated by a 
superscript. In addition, when the constituents occur on different sublattices they 
are considered as different constituents, and this is indicated by a subscript. There 
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are thus 17 constituents: free electrons and holes, which do not occupy lattice sites, 
and 15 constituents on the three sublattices according to the following table. 
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lattices which determine the symmetry of the crystal. The other constituents are 
called (point) defects, which thus consist of antisite atoms, interstitial atoms, trace 
elements and vacancies. 

The special choice of the constituents, and in particular the assumption that Ga is 
exclusively without charge on the sublattice <a, has been suggested by FCM and is 
called Freiberger model. The underlying reasoning of the Freiberger model relies on 
experimental hints, and in particular on the objective to calculate a quite realistic 
homogeneity region that fits to the properties of FCM made semi-insulating GaAs. 
For different choices and resulting consequences we refer the reader to [15]. 



3. A list of simplified problems with increasing complexity 

The last section has revealed that we have a quite complex thermochemical process 
under consideration. For this reason, it is appropriate to consider at first only very 
simple initial and boundary value problems that serve especially to design the 
mathematical model and to calibrate its parameters. 

In particular, we consider the two situations sketched in the following Figure. 
The two pressure vessels, which are closed by a movable piston, set two thermody- 
namic systems with prescribed outer pressure po and temperature To. The inner 
temperature is assumed to be homogeneous and equal to the outer temperature. 




The example on the left hand side considers a liquid and a solid phase sep- 
arated by a planar interface. This example is designed to study the influence of 
mechanical stresses on the chemical and interfacial equilibrium condition and on 
the diffusion process in the solid. 

In particular, the example on the right hand side considers a solid ball on fim 
scale in contact with a gas consisting of the four arsenic constituents As , As 2 , 
As%, As 4 . The solid ball contains at its center a liquid droplet, made exclusively 
from As and Ga , that can grow or shrink. The outer interface of the ball is in 
equilibrium with the gas, and we consider the case that only As 2 may cross the 
interface and become an interstitial As in the solid. 
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4. Thermodynamics of mixtures, Part 1: 

Variables 

We consider a body B whose volume V(t) may depend on time t. A space point, 
which is at some time t occupied by a material point P of B, is indicated by its 
cartesian coordinates x = (x z ) = (x 1 , x 2 , x 3 ). The notion of material point means 
the smallest units of B that cannot be further resolved on the chosen space scale. 
The body B may consist of coexisting fluid and solid phases, which are both 
mixtures of various chemically different constituents. 

The thermodynamic state of the fluid phase is described by the variables 

n a — n a (t, x) , a E {1, 2, ..., a^}, number densities of the constituents, 
v z = v l {t , x), barycentric velocity of the mixture, 

T = T(£), (absolute) temperature. (1) 

The total mass density of the mixture, p, is given by the sum over the number 
densities x molecular weights, M a , and for some purposes it is useful to calculate 
the mixture velocity, v, from the velocities of the constituents, v l a . Thus we write 

Q-L QL 

p = y^M a n a and v l = ^ ( 2 ) 

a= 1 a= 1 P 

In order to describe the deformation of the solid phase we must at first introduce 
a reference configuration. To this end, we indicate at a fixed reference time to any 
material point P of the solid phase by its cartesian coordinates X — (A 1 , X 2 , X 3 ). 
The motion of P is described by the field X) = (y 1 ^, A), y 2 (t, A), y 3 (t, A)), 
which gives the coordinates of P at time t > to : 

x { = x% A). (3) 

The field y 2 is called motion , and its first derivatives 

v l — and with J = det(F^) ^ 0 (4) 

C/t uX^ 

are the velocity and the deformation gradient , respectively. The condition on the 
Jacobian J guarantees that (3) is invertible. 

It is well known that the Jacobian is related to the total mass density by 

( 5 ) 

where pR is the mass density in the reference configuration. 

The solid phase may consist of a mixture with as constituents, and thus the 
thermodynamic state of the solid phase is described by the variables 



= n a =n a (t,x), T — T(t). 



( 6 ) 
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5. Thermodynamics of mixtures, Part 2: 
General Constitutive Model 



We consider a GaAs crystal consisting of two coexisting phases, which are formed 
by a nonviscous mixture of liquids and a thermoelastic solid mixture, respectively. 
There are eigenstrains due to the presence of point defects, misfit strain and dis- 
locations. However, dislocations are not considered in the current paper. 

Beyond the variables, further important quantities are needed to describe the prop- 
erties of mixtures. These are: the specific internal energy, u , the specific entropy, s, 
the specific free energy, ip = u—Ts, and the chemical potentials of the constituents, 
Pa, [10], [11]. 

Furthermore, there are three measures of stress, viz. the Cauchy stress, a lk , and 
the first and second Piola Kirchhoff stress, respectively, a lk and t lk : 

-l -l -l 

(j lk = a k \ a ik = JF lj a jk , t ik = JF ij F kl a j \ (7) 

-l 

where F lj are the components of the tensor (F -1 ) T . The Cauchy stress appears 
naturally in a thermodynamic description where the reference configuration of 
the solid phase does not explicitly appear. The first Piola Kirchhoff stress is a 
simply measurable quantity, and appears naturally in the description that relies 
on the reference configuration. Finally, the second Piola Kirchhoff stress is needed 
to formulate constitutive laws relating the stress to the variables, see [13]. 

Let us decompose the Cauchy stress into its trace and the traceless part, the latter 
one being indicated by angle brackets around the indices; for example we write 

a ik = _ p5 rk + a <ik> with p = 1 nn ( 8 ) 

o 

The quantity p is called pressure. 

There are also various measures of the deformation of a body. The most important 
one is the right Cauchy Green tensor, C lk , and its unimodular restriction, which 
we denote by c lk : 

C ik = pmipmk and pk = j- 2 /Zfjik SQ that det ( c *fc) = \ (9) 



All these quantities are related to each other by the Gibbs equation, which reads 
in fluids as well as in thermoelastic solids 



1 



dpip = —psdT + -J~ l /h ik dc ik + Ua dn a ■ 

a 

The Gibbs equation implies 

Furthermore, there holds the Gibbs/Duhem equation 

pip + p = ^p a n a - 

a= 1 



( 10 ) 



(ii) 



( 12 ) 
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For the solid phase the knowledge of the free energy density p s if s (T^ ni,...,n as , c lk ) 
is sufficient to relate the quantities we have introduced to the variables. 

In a mixture of isotropic fluids we have c lk = 5 lk , and the second Piola/Kirchhoff 
stress drops out of the Gibbs equation. Thus for the liquid phase the knowledge of 
the free energy density Pl^l{T, n\ ,...,n aL ) is sufficient to relate the aforementioned 
quantities to the variables. 

The proof of the Gibbs equation for fluid mixtures and for the solids that only 
consist of a single substance can be found in [10]. The generalization to a solid 
mixture, viz. eqn. (10), has not been published elsewhere, but it can be derived 
following the strategies that are outlined by I. Muller in [10]. 



6. Thermodynamics of mixtures, Part 3: 

Explicit representations of free energies, chemical potentials, 
pressure and stresses 

In general, there are three competing additive contributions to the free energy 
of a mixture: (i) entropy, (ii) interaction or mixing energy, and (iii) mechanical 
energy. If only the entropic contribution is considered, the mixture is called an 
ideal mixture. In the sequel, we will combine the two first contributions in the so 
called chemical part of the free energy, and we write 

f»p = f»p chem + fnp mech and accordingly y a = y c a hem + y™ ech . (13) 

6.1. Chemical potentials and pressure of the liquid phase 

For the sake of simplicity, we assume that the liquid phase only consists of Ga 
and As. We give at once the explicit constitutive laws for the chemical potentials 
and the pressure. Then, the free energy density can be calculated by means of the 
Gibbs/Duhem equation (12). 

/4r = &a(T) + RT Log(l -y) + (Lo + L x (3 - 4 y))y 2 , 

r = AsV) + RTLog(y) + (Lo + L,(l - 4y))(l - yf . (14) 

The newly introduced quantity y = n^ s /n is the arsenic mole fraction. In both 
the above formulae, the first terms give the standard chemical potentials. The 
second terms are the entropic contributions, and the third terms give the mixing 
energy according to the Redlich/Kister model, L 0 ,Li measuring the strength of 
the mixing energy. R is the universal gas constant. For details, see [12]. 

The mechanical parts of the chemical potentials are nonclassical contributions. 
They describe the interfacial misfit , which arises because the transition from the 
solid phase to the liquid phase is accompanied by a change of the specific volume. 
In fact, arsenic rich liquid needs more space than solid GaAs. In order to describe 
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this phenomenon, let us at first introduce the function 



ptherm(J',y} 

m M As y 

As{ MasV + M Ga (l ■ 



■ v ) 



(15) 



where va s (T) and vc a {T) are the specific volumes of pure As and pure Ga, re- 
spectively. Next we choose as a reference density of the liquid p# = p therm (TR , 1), 
and we set p*(T) = p t s herm (T), where p t ^ ierrn is the density of stoichiometric solid 
GaAs which can be read off from a table. Finally, we measure the interfacial misfit 
by 



P*(T) ~ PR 
PR 

There result the mechanical chemical potentials 



(16) 



mST* = —K L Log 



and 



(17) 



Pr \P*(T)j ’ "" Pr 

which rely on the simple constitutive law for the liquid pressure: 

' P — P*(T) \ 



P = Pr + K l 



Pr 



I' 



(18) 



Recall that the liquid density is denoted by p, while K l is the bulk modulus of 
the liquid. 



6.2. The free energy of the solid phase 

The solid phase of GaAs consists of three fee sublattices a, /?, 7 which are occupied 
by seventeen different constituents according the description of Section 2.2. Ho- 
wever, there are only fifteen unknown number densities, because the trace elements 
Si, C and B are prescribed. 

The chemical part of the free energy density of the solid phase is given by 



pi)' 



chem 



= ^n a p°(T) + RTY/ta log( — ) + IY. I°b(T)n a — - 
£i ^ USL 2 a)b=l nSL 



(19) 



The free energy density p ^ chevn consists of three contributions, which are the 
standard chemical potential pP a (T), consisting of the energy and entropy constants 
(first term), entropy (second term) and mixing energy (third term). The constant 
q , (= 4 in the current study), is the coordination number and the temperature 
dependent quantities I a b denote the interaction energies. We take only nearest 
neighbour interactions into account. Furthermore, we assume that electrons, holes 
and the other charged constituents have only entropic free energies, which is a fairly 
good approximation. Examples for numerical values for the nonzero interaction 
energies can be found in [14] and [15]. 
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The free energy density p^ mec/l describes elastic energy including eigenstrains and 
it is given by 

pi ,mech = jL(j2/3 c « _ C ^)K^ 1 (J 2/3 C*' - Cf) - p Ry Jdet(d»)\ det(c . j ) = 1 . (20) 

K lJ kl is the stiffness matrix and Cl 3 is a measure for the eigenstrains. The form 
of p 2 p mech has been chosen in such a way that the Gibbs equation implies Hookes 
law for the second Piola/Kirchhoff stress, viz. 



i -i 

fj = _ K vkl(j2l3 c kl _ C klj _ pR jl/3 c kz ( 21 ) 

The matrix C* 1 describes the various eigenstrains due to point defects, see the 
next Section for details. 

Next we calculate from (19) and (20) the chemical and mechanical parts of the 
chemical potentials. There results 



chem 

r^a 



= ^ a {T) + RT\og(^-) 
\nsL J 



17 



17 



6=1 a , 6=1 



n a rib 



n 



( 22 ) 



SL 



for the chemical part of the chemical potential, while its mechanical part reads 

^mech = (l(J 2 / 3 c^ - C?) - C?) R ijkl (J 2/Z C kl - Cf). (23) 



7. Thermodynamics of mixtures, Part 4: 

Eigenstrains due to point defects 

There are three different mechanisms that give rise to eigenstrains, which may 
appear due to (i) dislocations, (ii) point defects, and (iii) interfacial misfit. 

The role of eigenstrains due to dislocations have been discussed in the introduction 
and will not considered anymore in the current study. 

The eigenstrains due to interfacial misfit can be attributed either to the liquid 
phase or to the solid phase, see for example [7]. In the present study we have 
chosen the liquid phase in order to incorporate the interfacial misfit, see Section 

6 . 1 . 

There remain to discuss eigenstrains due to the various point defects on lattice 
sites of the two sublattices a and /3, viz. Asa \ V^, V^~\ B& \ Si ^ on 

sublattice a , 0^\ 0^~\ 0^\ C ^ on sublattice (3. Note that the Freiberger model 

does not lead to eigenstrains on sublattice 7 , because the defect As^ needs the 
same space as an As atom on the sublattice (3. Uncharged vacancies on interstitials 
have no effect at all. The defects on sublattices a and (3 induce eigenstrains, which 
are different for the neutral and the charged defects. The neutral defects except 
the vacancies can be fairly well approximated as elastic balls, whereas the charged 
defects and the vacancies behave as they were rigid balls. 




Nucleation and Growth of Liquid Droplets in Single Crystals 



123 



Let us decompose the function Cl 3 as a product of thermal eigenstrains and eigen- 
strains due to defects: 

C?=C% erm C% fecte . (24) 

The thermal part is obviously given by 

Ci k herm = (l + a(T-T R )) 2 5 ik , (25) 

where a denotes the coefficient of thermal expansion. 

A simple calculation relying on elastic and rigid balls yields explicit expressions 
for the corresponding eigenstrains, which can be written as a product of elastic 
and rigid eigenstrains: 



rikj 

defects 






3 n S L< 



(26) 



for a g {As^\B^,0^} and b g {Asi +) , vj 0) , vi 3_) , S*L +) , 0 ( ~\ 0 ( ^\ cff\ V^}. 
The constants Z a and Zb are given by 

-5a 



= 



Ajh 

and Zb = — 



(27) 



1 + 8 a 

fi and K are the shear and bulk modulus of the solid phase, and 

8 a = {-0.055, -0.119, -0.317} (28) 

8 b = {-0.119, -0.119, -0.119, -0.119, 0.479, -0.317, -0.317, -0.317}. (29) 



The difference of the two representations for eigenstrains arises from the fact that 
elastic eigenstrains are characterized by continuous normal stresses at the interface 
between an elastic ball and the solid phase, while rigid eigenstrains are due to a 
prescribed normal displacement at the interface between a rigid ball and the solid 
phase. 



8. Thermodynamics of mixtures, Part 5: 
Field equations 



The variables of the thermodynamic model are the five components of the strain 
field and the fourteen number densities of the constituents in the solid phase, while 
in the liquid phase the only variables are the two number densities of Ga and As, 
because for simplicity we do not allow the other constituents of the solid to cross 
the interface. 

The field equations for the variables rely on several assumptions: 

(i) Mechanical equilibrium is reached immediately, so that the strain field is de- 
termined by the quasistatic momentum balance for the given number densities. 



da lk 

dx k 



-0. 



(30) 
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(ii) Local chemical equilibrium in the solid phase is reached much faster than 
diffusional equilibrium, so that for the given total number densities of Ga and As, 
the other number densities are determined by the 14 nonlinear algebraic equations 
representing the laws of mass actions. These equations will be derived in Section 
9 and are given by (40), (41), and (44)-(49). They have the generic form 

F A {ni, n 2 , ...n u \ T, n% a , n s As ) = 0, A e {1, ..14}. (31) 

We write the solution of these equations as 

n a = h a (T,n% a ,n s As ). (32) 

(iii) Thus only a single diffusion equation in the solid is needed. Furthermore, this 
equation simplifies due to the fact that only the mobility of the interstitial arsenic 
is important. The diffusion equation reads 

-^ = MA(A M », - AoaJ, (33) 

where M denotes the mobility of the interstitial A.sl, 0) and n s As denotes the total 
number density of the As content of the solid. fi As (o) and fiGa a are the chemical 

potentials of the indicated substances. However, the equations (32) has been used 
to eliminate those constituents that are in thermal equilibrium with riQ a ,n As , and 
this procedure is indicated by the tilde. The exploitation of the diffusion equation 
is carried out in [4]. 

(iv) The liquid is assumed to be in equilibrium and its number densities can be 
read off from the liquidus line of the phase diagram. 

(v) The form of the interfacial jump conditions depends on the interfacial mobili- 
ties. If the interfacial mobilities are much larger than the bulk mobility of As^°\ 
local equilibrium at the interface can be assumed, and the jump conditions are the 
ones that will be derived in the next two sections, see equations (43)2, (51)and (52). 
However, currently the magnitudes of the interfacial mobilities are not known, and 
it may happen that they have the same order as the bulk mobility. In this case the 
interfacial jump conditions result from the Becker/Doring theory of nucleation, 
see [5] for details. 



9. Thermodynamics of mixtures, Part 6: 

The Global Laws of Energy and Entropy 

In this Section we derive the equilibrium conditions for two coexisting phases. We 
consider the systems from Section 3 consisting of a container, which is closed by 
a piston. At time t , the volume V(t) includes either the liquid/solid or the liquid- 
droplet /solid/gas. We assume that the outer pressure po is given. Furthermore we 
prescribe the outer temperature To and assume that T 0 is constant throughout 
V(t). The dashed closed line indicates a control volume which we introduce in 
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order to apply the global balance laws of energy E and entropy S to the described 
system: 



dE dV • 

~dF +Po H- Q ' 



and 



dS > Q_ 
dt Tq 



(34) 



The quantity Q denotes the heat power, which may enter, Q > 0, or leave, Q < 0, 
the system, so that a constant temperature Tq is guaranteed. The equality sign 
holds in equilibrium, whereas in nonequilibrium, the variation of the entropy is 
greater than the ratio of supplied heat and temperature. This statement expresses 
Clausius’ version of the second law of thermodynamics, [2]. 

Elimination of the heat power leads to the main inequality 

dA 

— < 0 with the definition A = E — To 5 + poV. (35) 

The newly defined quantity, A, is called the available free energy or availability. We 
conclude that in dynamic processes that run at constant outer pressure, constant 
temperature and constant total mass, the availability must decrease and assumes 
its minimum at equilibrium. 

The availability contains the combination ^ = E — TqS denoting the Helmholtz 
free energy, whereas A = + poV give the Gibbs free energy G only if there is an 

overall constant pressure p = po in the interior of the volume V. However, we will 
not meet this situation in the two considered examples. 

The evaluation of the main inequality requires an explicit knowledge of the func- 
tional dependence of energy, entropy and volume on the variables that may change 
in the process under consideration. In the next sections we will introduce and dis- 
cuss the functions we need. 

The two systems considered in Section 3 consist of solid, S, liquid, L, and gas, G, 
phases, which occupy different regions, so that the total volume at time t is given 

by 

V(t) = V s (t) + VlC 0, V(t) = V s (t) + V L (t) + V G (t), (36) 

for the left and right example, respectively. The total free energies of the two 
examples can be additively decomposed into the free energies of the phases and 
the surface free energy of the interface boundaries 



® = *s(t) + ^ L (t) + 9 G (t) + gO l . (37) 

The surface energy, which is also called surface tension, is denoted by a and the 
liquid/solid interface area is Ol- 

The free energies of the phases can be represented by the volume integrals 







Vs 





( 38 ) 



where ip is the specific free energy and p is the mass density of the mixture. 
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10. The laws of mass action and interface conditions 

We consider the first problem from Section 3, where a liquid phase is connected to 
a solid phase by a plane interface. We assume that both phases are in equilibrium 
and we pose the problem to determine the corresponding equilibrium conditions 
by exploiting the minimum condition for the availability. 

Thus we determine the minimum of the function 

A = ps'tpsVs + pl^lVl + Po(V s + V L ) (39) 

under several side conditions. 

Conservation of atoms 

Ngcl = nca a Vs + riQ a V L , 

Nas — ( n As(3 + n Asi 0) n As ( + } n As^ 0) ^ S n As^L^ 

No — ( n o ( 0 ) + n 0 i-) 4- n 0 (+))Vs, N Si = n Si (+)Vs, 

Nb = n (o)Vs, Nc = n r (-)Vs • (40) 

Dot ^ R 



The three sublattices have the same number of lattice sites: 



TiAsp 4" n o ( 0 ) d - ) 4~ ^q(+) 4" r ^ / (j (-~ ) 

(' n G a a + n As ( 0 ) + n^^+j + n v ( 0) + 3n v <.s-) + n B m + n Si <+)) = 0 (41) 

riAsp + n v ( o) + n o ( 0 ) + n 0 (-> + n 0 <.+) + ™ c (-> - (n v ( 0) + n As m) = 0. 

Conservation of charges 



}(—) 4" 4" SN v (3~) H - Ti e (in ^3 (+) 4” 4" ^q(+) 



+ n h ) = 0. (42) 



We can get rid of these nine constraints by introducing Lagrange multipliers. The 
variables are the volumes Vs and Vl of the solid and liquid phase, respectively, 



the number densities n n 

(o) , TlAsp ? Tl 



Gai n As 



of the liquid, and the fourteen number densities, 



n Gap > n As t 



^ s (°) i ^As^^ 5 n o (0) 5 ^ 5 ’ ^y^ 3- ) ’ ^y(°) 5 ^y(°) > 



n e , n/i, of the solid. 

The minimization with respect to Vs and Vl yields the conditions for mechanical 
equilibrium, which obviously read 



QQ 

Pl=Po , o- = -PL- 



(43) 



Furthermore there result two groups of algebraic equations. The first group con- 
tains the so called laws of mass action that determine the number densities of the 
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solid phase for given (n^ a , n^ s , no, nc, nsi, n^). The laws of mass action read 



Pa S(3 ~ P (°) _ Mv^ 0) + /V 7 (0) ~ 

Ma 4 0) - ^M 0) ~ ^vi 0) + Mvf> = °> 

My(°) + My(°) + Py(°) = 0, 



(44) 

(45) 

(46) 



P 0 (+) _ M O < 0 > + Me - — (J <33> = 0, M 0 (-) “ Mo<°) - Me + — cr <33> = 0, 

^0 ^0 PS ^0 ^0 PS 



p (3-) - p v ( 0 ) - 3p e + 3— cr <33> = 0, 

^0 ^ PS 

Ma 4 +) - ^4 0) + Me - ^ <33> = 0, 

i . _j_ . . _ oMlrr< 33 > — n 

A^e H - fJ^h £ & — U. 

Ps 



(47) 

(48) 

(49) 



The equations of the second group are jump conditions at the interface that relate 
the solid densities to the number densities of the liquid. The jump conditions read 



PGa a ~ 

1 



MGa _<33> L __ n 

- Pv a - PGa - V, 



Ps 

3 (Ma4°) + M^ S 0 + m As w) - 



— » <33> - = 0. 

Ps 



(50) 



11. Interfacial conditions at curved interfaces 



In this section we generalize the interfacial jump condition to the case where the 
liquid and the solid phase are separated by a curved interface. The following re- 
sults can be obtained for spherical droplets by minimizing (37) or for nonspherical 
droplets by methods that are introduced in [3]. The mechanical equilibrium con- 
dition (43) now reads 

-(<T ik v i v k +p L )=2 <tK m , (51) 

where a denotes the surface tension, and Km is the mean curvature, with Km = 
— 1/r for the spherical liquid droplet discussed in Section 3. The jump conditions 
regarding the chemical potential at a curved interface read 



PGa a - 



Mca 

PS 



-a 



<ik> 



I/V - nv a - n Ga = 0, 



;(m A s (<» +vas0 + m As (o>) 



m As 



,<ik> i k 



v l v K - vl s = 0 . 



PS 



(52) 
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12. On the influence of misfit strain on the phase diagram 

The equilibrium conditions that were derived in the last section can now be used 
to calculate the liquidus line and the homogeneity region of the phase diagram. In 
particular, we will determine the influence of mechanical effects on these parts of 
the phase diagram. To this end, we need to know values for the standard chemical 
potentials, the interaction coefficients and the Redlich/Kister coefficients. Fur- 
thermore we need the elastic constants and thermal expansion coefficients. The 
parameters that are needed for the liquidus line are taken from [12]. The para- 
meters that are needed for the determination of chemical equilibrium of the solid 
phase are read off from different sources: the data for (49) are from [14]. The data 
for (44), (45), (46) and (48) are from [15], but they have been slightly modified. 
Data regarding the oxygen were reported from [9]. The elastic and thermal ex- 
pansion parameters can be calculated from the data given in [1]. The temperature 
dependent data for liquid arsenic are read off from [8]. The misfit parameter were 
calculated from the atomic radii data, where some information regarding the va- 
cancy misfit was given by [6]. The details of the exploitation of the equilibrium 
conditions are found to be in [4]. 




The two graphs were calculated from the solutions of the algebraic system 
(40) through (50). The graph on the left shows the liquidus line of the GaAs 
phase diagram, and turns out to be almost independent of mechanical effects. 
The graph on the right represents the homogeneity region of the phase diagram, 
where the defects are homogeneously distributed over the lattice sides. The smaller 
region results if mechanical effects are ignored. If mechanical effects are taken into 
account, the homogeneity region is affected by the outer pressure and increases 
if the outer pressure is increased. The graph corresponds to an outer pressure of 
500 bar. A similar calculation for the situation with a spherical droplet within a 
solid matrix, as it is explained in Section 3, has exhibited a much more drastic 
mechanical effect that does not need such high external pressure as in the current 
case with a plane interface, see [4]. 

The concentration of the charged antisite is of most importance regarding 

the semi-insulating behaviour of GaAs. The next Figure shows the influence of 
mechanical effects on the concentration of • 
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Right from the melting point, the concentration of As^ 
with increasing outer pressure. 



increases drastically 



13. Synopsis and outlook 

The current paper sets the beginning of a series of studies on the formation and 
growth of liquid arsenic droplets in solid GaAs. In the first paper we have developed 
a thermodynamic model which, in particular, takes local mechanical stress fields 
into account. Some simple examples show their significance on the laws of mass 
action and on interfacial equilibrium conditions. 

The next step of our investigations will be the study of the diffusion problem 
including the coupling of the Becker/Doring theory to the liquid/solid interface 
conditions. 
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On the Intermediate Surface Diffusion Flow 

Joachim Escher and Kazuo Ito 



Abstract. The intermediate surface diffusion flow is a nonlocal geometric sur- 
face evolution law with two physical parameters. We survey known results of 
this flow, such as existence results, the limiting motion concerning a param- 
eter, and self-intersections of solution surfaces. Some new results on a loss of 
convexity and singularities of solutions are also announced. 



1. Introduction 

We study the following nonlocal geometric evolution equation: 

V(t) = -A r(t) (5 - ii A r(t) ) 1 H(t) on T(t) for t > 0. (1) 

Here t > 0 denotes the time variable, and T(t) is an unknown, evolving closed 
compact oriented connected hypersurface immersed in R n , with n > 2. We write 
Ap( t ) for the Laplace-Beltrami operator on T{t) with respect to the Euclidean 
metric. Since T(t) is oriented, its unit normal vector field {N(t,x)]x G T(£)} is 
well-defined. We have two possible directions for the unit normal vector but we 
fix arbitrarily one of them. Then, in the direction of iV(t), H(t) denotes the mean 
curvature of T(t) multiplied by (n — 1) and V(t) stands for the normal velocity 
of the family {T(t ) ; t > 0}. We mention that the evolution equation (1) does not 
depend on the choice of orientation. Finally, in the setting D := 1/8 > 0 and 
M := l//i > 0, D is the diffusion coefficient and M is the mobility constant. 
Following J. W. Cahn and J. E. Taylor [2, 20], we call (1) the intermediate surface 
diffusion flow. 

The evolution equation (1) has to be supplemented by an appropriate initial 
condition 

r(0) = r„, (2) 

where Tq is a given closed compact oriented connected hypersurface immersed in 
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The intermediate surface diffusion flow (1) was introduced to describe a geo- 
metric growth law for a moving interface where surface diffusion is the only trans- 
port mechanism and the reduction of total surface energy is the only driving force 
for surface motion, cf. [2, 20]. The geometric features of this flow are the signed 
volume preserving and the area decreasing properties. To see this, we define the 
signed volume Vol(t ) for T(t) by 

Vol(t) := -- [ ( 7 ( t)\N(t))da(t ), 

n Jr(t) 

and the surface area Area(t) of T(t) by 

Area(t) := / 1 dcr(t). 

Jr(t) 



Here y(£) denotes the position vector of T(t) and da(t) stands for the surface 
measure on T(t) induced by the Euclidean metric in the ambient space i? n , and 
(*|*) is the usual inner product in R n . If T(t) is embedded and the direction of N(t) 
points inward, then the divergence theorem ensures that Vol{t) coincides with 
the usual volume of the domain enclosed by T(t). It is known that the following 
variational identities of V ol(t) and Area(t) for general moving surfaces T(t) hold 
true: 



Then one has 



ivol(t) = [ V(t)da(t), 

at JT(t) 

Area(t ) = — f H(t)V(t)da(t). 






—Area(t) = 



- [ A r(t) (5 - n A r( i)) 1 H{t)da{t) = 0, 

Jr(t) 

[ H(t)A m (<5 - a*A rw ) - 1 H(t)da(t) 

J m 

- [ V r( t) (s - mA m y 1/2 H{t) 2 do(t) < 0, 

Jr(t) 



where Vr (t) stands for the gradient on T(t) induced by the Euclidean metric in 
the ambient space R n . Hence one sees that the flow (1) is indeed signed volume 
preserving and area decreasing. 

J. W. Cahn and J. E. Taylor [2, 20] expected (1) to play an interesting role 
as a link between two known signed volume preserving and area decreasing flows. 
One is the volume preserving mean curvature flow (also called the averaged mean 
curvature flow ) 



V(t) = n~ l ( H(t ) - H(t)) on T(*) for t > 0, 



( 3 ) 
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where H(t) is the spatial average of the mean curvature H(t), i.e., 

H{t) := [ H(t)da(t) / [ 1 daft) . 

Jr(t) / Jr(t) 

The other one is the surface diffusion flow 

V(t) = —S~ 1 A r ^H(t) on T (t) for t > 0. (4) 

Similarly as above, one can check that both flows (3) and (4) are signed volume 
preserving and area decreasing. It was conjectured in [2, 20] that the limiting 
motions of (1) as 5 — > 0 and fi — > 0 are (3) and (4), respectively. This is the reason 
why (1) is called the intermediate surface diffusion flow. 



2. Existence results 

The first mathematical results for (1) was presented by C. M. Elliott and H. 
Garcke [4] . They proved for n = 2 that there exist local-in-time solutions and that 
solutions starting from initial data close to a circle exist globally and converge to a 
(possibly another) circle as t — >• oo. These results are extended by J. Escher and G. 
Simonett [10] to the higher dimensional case. Additionally, a general uniqueness 
result is obtained in [10]. We note that every compact surface of constant mean 
curvature is an equilibrium for (1), and the converse is also true by Liouville’s 
theorem. In particular, in the embedded case this leaves only the spheres, cf. [1]. 
Recently, J. Escher [5] has proved that solutions of (1) are real analytic in space 
and time, provided the initial surface is suitably diffeomorphic to an Euclidean 
sphere. In [4, 10] there is no rigorous investigation to clarify the dependency of 
(1) on the parameters 5 and /x. Recently, J. Escher, Y. Giga and K. Ito [6, 7] 
established a uniform local existence result of (1) for 5 — > 0, which is applied to 
prove that the limiting motion of (1) as 5 — > 0 is in fact the volume preserving 
mean curvature flow (3). For this we mention the precise result in Section 3. Up 
to now the question of the limit of (1) as fi — > 0 is still open. 



3. Limiting motion 

In this section we are interested in the limiting motion of (1) as 5 — > 0. So we are 
going to restrict ourselves to the case 5 G (0, 1]. Also fi is regarded to be a fixed 
constant. 

Theorem 3.1. (Limiting motion [6, 7]). Let Tq be of class C 2Jra with fixed a G 
(0, 1). Then the following hold true. 

(i) The problem (l)-(2) admits a unique local solution T 5 := {T <5 (t); t G [0, T]} 
of class C 1,2+c * on a common time interval [0 ,T\, i.e., T depends on ||ro||c 2 + Q > 
but not on 5 G (0, 1]. 
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(ii) The solution T 5 converges, as 6 ^ 0, to the unique solution of the volume 
preserving mean curvature flow (3) supplemented with (2) — with the same T o — 
in the C 1,2 + a -topology on [0,T]. 

4. Self-intersections 

In contrast to the mean curvature flow V(t) = H(t ), the flows (3) and (4) do not 
enjoy the maximum principle, cf. [11, 12, 18]. This is understood from the fact 
that the solutions of (3) and (4) can develop self-intersections even if the initial 
data are embedded. 

As an application of Theorem 3.1, we show that this is also the case for the 
flow (1), i.e., embedded hypersurfaces can be driven under (1) into self-intersections 
in finite time for S > 0. 

Theorem 4.1. (Self-intersections [6, 7]). Let 5 > 0 be fixed. There is an embedded 
closed compact oriented hypersurface Tq of class C 2 + a , with a E (0, 1), such that 
the unique solution V s of ( 1) starting from Tg develops a self -inter section at a time 
ts E (0 ,T$\, although T 5 stays in C 1,2+Q! on [O,^]. Here T$ is an existence time 
of the solution T 5 . 

We mention that T§ is eventually away from 0 also when <5 — > 0, thanks to Theorem 
3.1 (i). 

We are now going to sketch the crucial steps of the proof. Assume that (1) has 
the desired property for some <5* > 0 . Then the same is true for any 5 > 0. To 
see this, we introduce a scaling both in time and in the ambient space given by 
t — A ~ 2 t and x = A _1 x for A > 0. We then have the scaling relations V = XV, 
H = A H, and A = A 2 A. Thus (1) and (2) are rescaled to 

V(t) = -A m (a 2 <5 - _1 H(t ), i > 0; f (0) = f (5) 

Choosing A to be A 2 8 = S *, one sees that (5) has the property to develop a 
self-intersection in finite time for an embedded initial datum. Note that this self- 
intersection is invariant under the present scaling. Therefore (1) must have this 
property as well. 

Thanks to Theorem 3.1, the solutions of (1) and (3) with the same initial 
data are close to each other in the C 1,2+Q! -topology when 5* > 0 is sufficiently 
small. Since (3) can develop a self-intersection, cf. [11, 18], the same must be true 
for (1) with 6* > 0 small enough. Therefore it must be also true for (1) with any 
5 > 0 as well, as pointed out previously. 

5. Loss of convexity 

It is known that the flow (3) preserves the convexity of its solutions for all times, 
cf. [11, 15], whereas a solution of the flow (4) can lose its convexity in finite time, 
cf. [13, 16]. 
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We announce here a new result stating that the flow (1) has the latter prop- 
erty. In the following statement, a surface is said to be strictly convex if all the 
principal curvatures of the surface are positive. 

Theorem 5.1. (Loss of convexity [8]^. There is a smooth (C 6 + a with a E (0, 1) is 
in fact enough), closed, compact, oriented, strictly convex hypersurface Tq in R n 
such that the unique solution of ( 1) starting from To loses convexity in finite time, 
although it stays smooth. 

6. Singularities 

For the flow (4) in n = 2, J. Escher, U. F. Mayer and G. Simonett [9] presented 
numerical experiments for formations of singularities of solution curves with self- 
intersections. Recently, K. S. Chou [3] gave an analytical proof for the above 
mentioned experiments in a certain class of initial curves. 

In this section we announce some new results on the formation of singularities 
of solution curves of the flow (1). When we discuss this problem, we must take 
into account that two invariant quantities of the solution curves play important 
roles, namely the rotation index and the signed area. To define them suitably for 
our purpose, let us consider a general smooth connected closed curve T immersed 
in the plane. Let s and 7 be respectively the arclength parameter and the position 
vector of T. When 7 is described by s and we set T \= d^/ds, T is the unit tangent 
vector of T. Let N be the unit normal vector of T so that the frame {T, N} obeys 
the Frenet convention. Then it fulfills the relation N = R(tt/2)T, where R(tt/2) 
denotes the matrix which rotates the operand vector by the angle +7r/2. Let us 
denote by k the curvature of T; k is computed in the direction of N so that 
Frenet ’s formulae T s — kN and N s = —kT hold. Then we define the rotation 
index m of T by 




It is known that m has to be an integer. We also define the signed area A for T by 

A:=-±J^\N)ds. 

Remark 6.1. (i) When T is embedded, the above definition yields that A > 0 
when m — 1 and A < 0 when m — — 1. In fact, this is an easy consequence of the 
divergence theorem. 

(ii) Given T, the signs of m and A are only the matter of convention. In fact, 
we are free to choose arbitrary signs of m and A by adopting alternative definitions 
for the rotation index and the signed area. 

In this paper, given connected closed curves, we always adopt the sign convention 
so that m > 0. Using the notations mentioned above, we now rewrite (1) in the 
following form: 



V{t) = ~d 2 (S - nd 2 s ) 1 n{t) on V(t). 



(6) 
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In (6) dg and n(t) are used instead of A r ( t ) and H(t). Equation (6) has also to be 
supplemented with the initial condition (2). In the sequel, we will always impose 

(2) without further mentioning. 

We say that the solution to (6) develops a singularity (or blows up) in finite 
time if a smooth global-in-time solution does not exist. 

Let us denote by T ma x the maximal existence time of the solution T = 
{T(t)]t E [0 ,T max )} to (6). Thus the statement T ma x < oo means that the so- 
lution of (6) develops a singularity (or blows up) in finite time. 

The following result shows the blow-up rate of the Z/ 2 -norm of the curvature 
||ft(t)||2 of the solution of (6). 

Proposition 6.2. (Blow-up rate [8]/. Let m> 0 forT q. Assume that the signed area 
Aq for Tq is nonzero and that T ma x is finite. Then the following blow-up rate for 
the solution of (6) holds true: 

||k(*)II2 ^CiT^-t)- 1 / 4 (7) 

for t < Tmax sufficiently close to T ma x • 

Remark 6.3. The same blow-up rate as in (7) is also obtained for the flow (3) in 
[8]. K. S. Chou [3] obtained the blow-up rate for the flow (4), where the power in 
(7) is -1/8. 

Now we give some sufficient conditions for T max < oo. The first one is stated 
in the following result. 

Theorem 6.4. (A sufficient condition for singularities [8]/. Setu := (5— iid 2 s )~ l ^ 2 n. 
Assume that for any t E [0, T marc ), there is a aft) E [0,L(t)] such that u(t,a(t)) = 
0, where L(t) denotes the total length ofT(t). Then we have T ma x < oo. 

Corollary 6.5. ([8]) Assume that m — 0 for T 0 . Then T ma x < oo. 

Remark 6.6. In [8] similar sufficient conditions for singularities of solutions to the 
flows (3) and (4) are also obtained. 

For example, Corollary 6.5 tells us that if the initial data r 0 has a configuration 
of figure-eight, then the solution develops a singularity in finite time. For the flows 

(3) and (4), this kind of results have been already obtained by M. A. Grayson [14] 
and A. Polden [19], respectively. See also [3]. 

Even if the assumption in Theorem 6.4 is not fulfilled, a solution of (6) can 
develop a singularity, as shown below. For example, locally convex curves, i.e., 
curves with k > 0, do not fulfil the assumption in Theorem 6.4 since the operator 
(6 - fid*)-'/ 2 is positivity preserving, but a class of locally convex curves can still 
develop a singularity. The next theorem gives a sufficient condition for this case. 
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Theorem 6.7. (The second sufficient condition for singularities [8]^. Assume for 
To either 

(i) m > 1 and Ao < 0 
or 

(ii) m > 2 and L\ < AirmAo- 
Then T max is finite for the flow (6) . 

Remark 6.8. (a) By (ii) we can find a limagon to develop a singularity under the 
flow (6). 

(b) The condition (ii) has been first found by K. S. Chou [3] for the flow 
(4). Theorem 6.7 holds also for the flows (3) and (4), see [8], and hence it is an 
extension of the result in [3]. 

(c) Theorem 6.7 does not consider the case in which the initial curves Tq 
are embedded. But we would like to mention an interesting numerical experiment 
presented by U. F. Mayer [17], suggesting that there is an embedded initial closed 
curve To such that the solution of (3) starting from To first loses embeddedness 
and then develops a singularity in finite time. 

(d) Theorem 6.7 does not treat the case Ao = 0. But we have a result for the 
flow (4) saying that if m > 1 and Ao — 0, then T ma:r < oo, see [8]. 
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Abstract. We present a hyperbolic model for the growth of a polymer particel 
in the Zigler-Natta process. 



1. Introduction 

This paper is devoted to the problem of the growth of a spherulite in the process 
known as Ziegler-Natta polymerization. This is a rather complex problem and has 
been the object of a large literature. 

The term polymerization indicates the process in which a single monomer 
molecule joins a chain of similar molecules to form the polymer. The main pecu- 
liarity of the Ziegler-Natta polymerization is the presence of a catalyst, which is 
necessary to the polymerization. The catalyst is a metal molecule (titanium allide), 
spread on the the surface of very small particles (supported catalysis). 

A well accepted approach for describing the process in the proximity of a 
single catalyst particle consists in the so called “solid core model” for polymeriza- 
tion. Let us quote from [6]: “The solid core model for polyolefin polymerization 
is simply based on a spherical catalyst particle with a spherical shell of polymer 
growing around it. ... Growth of the polymer shell results directly from the rate 
at which polymer is produced. The kinetics depends on both the temperature at 
the surface of the catalyst and the monomer concentration at that point” . 

According to this description the newly formed polymer surrounds the cat- 
alyst, and the polymer shell has to expand to accommodate the new inner layer. 
This picture however poses a number of non trivial problems when we try to 
translate it into a coherent mathematical model. 

The very first question is how the monomer can reach the catalyst surface 
after a polymeric shell has been formed around it. It is generally assumed that 
the polymeric shell is a porous medium and the monomer fluid reaches the surface 
through a diffusion mechanism (or a Darcy type flux, depending on the pressure 
conditions.) 

A more intriguing question is “how does the polymer shell move in its ex- 
pansion?” . Even if we assume a priori a simple spherically symmetrical geometry, 
i.e. that the catalyst particle is a sphere and that the polymer shell is a growing 
sphere at any time, the equations determining the velocity field should include the 
structural assumptions about the mechanical behavior of the polymer shell itself. 
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A simple way to write the equation for the velocity field, under the assumption 
of spherical symmetry, is to assume that the expansion of the shell is “porosity 
preserving”, i.e. that the spherical shell grows rearranging the polymer chain in 
such a way that the porous matrix formed by the interlaced chain has constant 
porosity. This allows to write a simple scalar equation for the velocity field, which 
is sufficient to determine the unique non trivial (radial) component of the field. 

In such a way a coherent mathematical model can be derived. A mathematical 
analysis of it has been performed in [1]. 

However this is not the only possible assumption, even in the spherical sym- 
metric case, and it fails to give a sufficient number of equations if the symmetry 
assumption is not satisfied. 



2. The elastic model 



In order to overcome the geometrical restriction of the solid core model, we have 
to introduce continuum mechanics into the model. 

In fact we must specify how the porous matrix formed by the growing polymer 
shell reacts to the expansion. This means that we have to introduce a stress tensor 
as a new unknown quantity and a constitutive assumption relating the stress to 
the deformation. 

The simplest assumption, at least from the conceptual point of view, is that 
there is a linear dependence between the strain rate and the deformation rate 
(hypoelasticity) . 

This means that at any point in the polymer shell we assume the hypoelastic 
constitutive relation between the increments of the stress tensor (a ij) and the 
strain e^, given by 



Ddij — EijkiDe 



°kh 



Eijki = (K — -G)SijSki + 2 G5ik$ij, 



where K and G are respectively the bulk and the shear moduli, [5]. 
The strain is then expressed by 



Deij — CijDt , 



1 / dvi dvj 

2 \dxj + dxi 



(1) 

(2) 



where is the strain rate of the velocity field v. 

The increment operator D must be, in principle, expressed through the 01- 
droyd derivative. However, in practice the use of the material derivative 



d _ d d 
dt dt Vk dxk 



(summation over the index k) is sufficient. 

With this simplifications, the constitutive relation becomes 



dcr, 

dt 






Eijki&ki ■ 



( 3 ) 
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Once the moduli K and G have been determined, equation (3) tells how the 
material forming the porous matrix reacts to deformation of the matrix itself. 

3. The two phase system 

In order to give a complete description of the expansion we have to take into 
account that our system is a two phases system composed by the matrix and the 
monomer inside the pores. This implies that we have to write appropriate equations 
expressing the following facts: conservation of mass for both phases, the solid 
one(i.e. the porous matrix) and the fluid one (i.e. the monomer inside the pores); 
conservation of momentum for both phases; constitutive relations, including (3); 
appropriate boundary conditions. 

The basic form of the equations for a two-phases porous-fluid system can be 
found in the book of Nikolaevskij, [5]. 

Conservation of mass for the solid gives 

A 

— (p s (l - n)) +div \p s {l -n)V s ] = 0, (4) 

and conservation of momentum for the solid phase gives 

(1 -n)p s ^V S +V S -VV S = div gf — (1 — n) Vp — R , (5) 

where p s is the density of the solid, V s is the solid velocity, n is the porosity, p 
is the fluid pore pressure and of denotes the Terzaghi effective stress tensor, [5], 
which can be expressed in terms of the pressure and the solid matrix stress tensor 
g by gj = (1 — n) {a_ + pid) . 

The vector R is the fluid-solid interaction force given by R = r (V s — V/) , 
where V/ is the fluid velocity (with respect to a fixed frame of reference) , and r 
is the resistence coefficient,. . . [5, (4.36), p. 192],. 

Similar equations can be written for the fluid phase. Here we assume that the 
pressure conditions are such that the monomer can be treated as a liquid, see [3]. 
For slow motion we can assume that the fluid motion inside the (moving) porous 
medium is described by Darcy’s law, see Bear [2], 

q r = -fcVp, (6) 

where k is the hydraulic conductivity, q r = (q — nV s ) is the specific discharge 
with respect to the moving solid particles, and q = nV f is the specific discharge 
with respect to the fixed coordinates (here we can disregard the gravity in Darcy’s 
law), [2]. Assuming the fluid to be incompressible, the fluid mass conservation 
becomes 

dirt 

Pf-fo+ diy \Pf (qr+ nVs)] =0 (7) 

Another simplification can be made concerning the solid matrix, see [2]. It 
consists in assuming that, although the medium as a whole undergoes deformation 
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(reflected in the change of porosity), the solid grains themselves are rigid, which 
implies that p s = const. 

Under this assumption, equation (4) reduces to 

^(l-«) + div[(l-n)V a ]=0, (8) 

which, together with the assumption of incompressible fluid, allows to cancel the 
porosity term in the fluid mass conservation equation and to reduce it to 

div [q r + V s ] = 0 (9) 



4. Boundary conditions 

The boundary conditions are one of the most delicate points in the modelling of 
the growth of the polymer shell. 

According to [6] : “It is thought that the growing polymer is directly attached 
to an active titanium atom on the catalyst surface, and propagation occurs by 
insertion of monomer between the metal atom and the growing chain.” 

Suppose than that at time t the polymer shell (the solid matrix) occupies 
the region Q(t)\Qo surrounding the catalyst particle (solid core) flo (for simplicity 
we assume that both Cl{t) and flo are simply connected regions). The growth 
mechanism can be modelled assuming that a reaction takes place at the catalyst 
surface <9flo transforming the monomers in the fluid phase into parts of the chains 
forming the polymer shell. The conservation of mass in this reaction will be one 
of the boundary conditions. 

In turn, the growth of the chains pushes the shell and this can be accounted 
for prescribing that the normal velocity of the solid matrix at points on flo is 
proportional to the quantity of monomer entering the reaction. 

So, if V s is the solid velocity at the surface of the catalyst, and v denotes 
the outward normal to dflo, the quantity p s { 1 — n)V s • y_ is the quantity of newly 
created polymer for unit time, and PfriV s • v_ is the quantity of monomer which 
has been trapped into the newly formed porous matrix. The sum of these two 
terms must equal the quantity of monomer which has reached the inner boundary, 
namely —pf q r • z/, so that we can write the first boundary condition 

p s ( 1 - n)V a • v + pfTiV s • \L = -p/Qr * K, (10) 

or, using (6), 

n + — (1 — n) 

Pf 

We assume that the speed of reaction is proportional to the pressure of the 
monomer fluid, in agreement with the microscopical interpretation of the pressure. 
Thus we can write the second boundary condition as 

ps( 1 - ri)V s -V = \p 



V s — kVp ) • v = 0. 



( 11 ) 



(12) 
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Finally one has to decide how to fix the value of the porosity of the newly 
formed polymer matrix. The simplest thing to do is to assume that this porosity 
is constant (i.e. depends only on the characteristics of the catalyst surface) 



n = no 



(13) 



The first boundary condition on the external boundary dQ(t) is simply de- 
duced from the fact that dQ(t) is a material surface whose point are moving with 
speed V s . Then, if dfl(t) is the set of the points satisfying an equation of the form 
/(x,£) = 0, the boundary condition is 



V,-i/ = - 



4L 

dt 

|V/| 



(14) 



Since the particle is surrounded by the monomer fluid, it is reasonable to 
assume that at dfl(t) the pore pressure of the fluid equals the exterior pressure 



P = Po- 



(15) 



Finally, since there is no extra load on the external surface, but for the fluid 
pressure, the effective stress vanishes on the external boundary, 

(g + pM)^ =0. (16) 



5. Equation for the one dimensional case 



To investigate the well-posedness of the complete set of equations is a very hard 
task because of the lack of a good theory for hyperbolic systems in R 3 . In order to 
illustrate the kind of mathematics involved in the problem, we devote this section 
to the one-dimensional problem 



(v Kp x )x 


= 0, 


(17) 


(ft + (vip) x 


= 0, 


(18) 


V+P , 1 T-> 

(f x + R 


= 0, 


(19) 


pp pp 




<rt + vcr x ~ ®v x 


= 0, 


(20) 



where v is the velocity, ip = (1 — n) is the local solid fraction, p is the solid density 
and a is the solid stress. The reaction term R is given by 



rkp x 






-p 



(21) 



where r is the resistance constant, see [5]. 

Equations (17)-(20) must be solved in a domain 



D(6) = {0 < x < x(t), 0 < t < £} , 



(22) 



whose right boundary x = x(t) is a free boundary defined by the differential 
equation 



-(t)=v(t,x(t)), s(0)=0. 



(23) 
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Boundary conditions for (17)-(20) are given by 



p(t,x(t)) 


= Po , 


(24) 


v(t, 0) - Kp x (t,0) 


= o, 


(25) 


o{t,x{t)) + p{t,x{t)) 


= o, 


(26) 




= Po, 


(27) 


v(i,0) 


= pp(t, 0) , 


(28) 



where po , </?o and p are given positive constants 1 

For any given pressure p(x,t ), we can regard the last three equations of the 
system (17)-(20) as a (possibly) hyperbolic system, which can be written in the 
vector form 



Ut + B(p,u)u x = F(p,p x , u), 



(29) 



with 



V 

u = | v 

a 



B = v 

1 pv 



ip 0 



, F = 



0 —a v 
The eigenvalues of B are given by 



0 

-±R 

pep 

0 



(30) 



a — (a + p) l<x-(cr + p) 

v-\ , v, v+ 1 



(31) 



which implies that the system (29) is hyperbolic for any given pressure p and stress 
a satisfying 

a + p < a. (32) 

It is worth noticing that condition (32) requires the effective stress to be bounded 
by the elastic modulus of the porous matrix. Since v is an eigenvalue of B and in 
view of the definition of D(5), it results that the hyperbolic problem is a charac- 
teristic free boundary problem , i.e. the free boundary is one of the characteristics 
of the system. 

We observe that, whereas equations (17)— (19), and in general (5)— (7), can be 
written in conservative form, this is not true for the stress-strain relation (19). So 
we cannot rely on most of the results in the theory of hyperbolic systems, which 
requires conservative systems. 

We can consider (26)-(28) as the boundary conditions for the hyperbolic 
system, and regard (24) and (25) as the two boundary conditions for the elliptic 
equation for the pressure (with given velocity field v(t, x)). This can be easily done, 
formally, by expressing the pressure and its spatial derivative as functions of the 



x We used in (25) the simplifying assumption pf = p s ; we shall explain how to deal with the 
general case at the end of the existence proof. 
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solution of the hyperbolic problem,. 

v(t,x) - Kp x (t,x ) = 0 , 

l r x (t) 

P{t,x) =p 0 - — I 



(33) 

(34) 



Notice that in (34) the upper limit of the integral is the free boundary itself. 



6. Local existence and Uniqueness 



In order to perform the mathematical analysis of the hyperbolic system (29) we 
write it in characteristic form by multiplying equations by the left eigenvectors of 
the matrix B and summing them up. The resulting system is 



o + p 



(7 +P 



, y/p{a- (a + p)),l 






(a +p)),l 



u t + 1/ 






~(g+p) 



U. 






a — (a+p ) 

— 2 — R, 

cn ' 



■ [u t + = 0 , 



Ut+lv+V ^ 5 ^ 1 lu. 






+p) 



R. 



(35) 

(36) 



(37) 



In (35)-(37), we express the source term as a function of the solution itself, 
namely 

R = (38) 

l-<p 



and we express p = p[u] as a function of the solution u given by (34). The system 
must again be solved in the domain D(S) (for, at least, a suitably small S) with 
the boundary conditions (24)-(28). 

According to the classification in [4], problem (35)-(37), together with bound- 
ary conditions (24)-(28), is both a free boundary problem with characteristic boun- 
dary and a problem in functional form. 

The strategy proposed in [4] for solving the free boundary problem consists 
in transforming it into a problem in functional form in the fixed domain 



R(5) = {(x, t) | 0 < x < £, 0 < t < £}, 



(39) 



by means of the domain transformation 



t — t , 



*(*) 



x . 



(40) 



We can use the results in [4] about the free boundary problem once we have checked 
that the extra functional dependence introduced by the term p[ u] in the coefficients 
and the eigenvalues of the hyperbolic system satisfies the hypotheses . . . required 
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in [4, 2.6]. Further, we have to control that the functional dependence in the last 
boundary condition does not affect the characterizing matrix , so that definitions 
[4, 2, (6.29)] and [4, 3, (1.11)] coincide. 

Since the coefficients are regular functions of p (as long as the hyperbolicity 
condition (32) is satisfied), the regularity requirements on coefficients and eigen- 
values are satisfied if we prove that the sup norm of p[ u] is 

Lipschitz continuous with respect to the sup norm of the solution u. This 
must be done in terms of the transformed variables: let us denote by 

t — t, x = X(t, x) = °^^-x , (41) 

the inverse transformation of (40), and by v(t,x) := v(t,X(t,x )) , and analogously 
for Ip, a. The pressure p(t,x) = p(t,X(t,x)) can be computed directly from the 
transformed variables 



P(t,x) =p 0 



^rj(u(£,0d£, with x{t) = J^v(t,t)&t . (42) 



Now, for any given Ui and u 2 , let p x and p 2 be the corresponding pressures 
defined by (42) (with Vi instead of v). We have 



h -P2W = SU P^ 



X\ (t) (_ Vi(t,£)d£- x 2 (t) (_ v 2 (t,£)d£, 

J X J X 

[_ (vi(t,€) -v 2 (t,0)d£ 

J X 



Xl(t) 

Kt 



(43) 



+ sup JH 



f Mt’Odt [ (”i (t,t) -V2 (r,r}) 
Jx Jo 



dr 



and the Lipschitz continuity follows immediately, because both and 



SU PI 



J^ v i(£>0d£ are uniformly bounded. 



We now write the boundary conditions for the characteristic variables, namely 



u\ = y/pav + a — -po + yfpav := G\ (44) 

=V+^^ = Vo + ^:=G 2 (45) 

^3 = —y/jmv + a = —y/pa pp[u](t, 0 ) + cr := G 3 . (46) 

To prove that our problem has a solution, it remains to check the resolubility 
conditions on the characterizing matrix of the hyperbolic problem. 

To this aim, let E(<$, f2i) be the space of the C 1 vector valued functions 
v = (v\,V 2 )Vz) defined on the angular region R(S) and satisfying the inequality 

max \\qk\\ < fii, (47) 

k 
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where (for simplicity, we omit the superscript in the following computation) 



Qi — 



dvj 

~dt’ 



dvi dvi 
Q3+i = ~dt + ~dx J 



1,2,3,. 



(48) 



and || || denotes the C 1 norm in R(5). 

Let v G E(5, f2i), and 

G[(t) = ^Gi(f;v(t,0)), & 2 {t) = ^G 2 (t;v(t,t)), G 3 {t) = ^G 3 (t;v(t,t)), (49) 

where the G{ are defined by (44)-(46). 

By definition, the characterizing matrix of the problem is a matrix (Oik) 
satisfying the inequality 



II^WII < Y]( e ik + e(6, fii)) max(||^||, \\q 3+k \\) + R 2 , 



k=l 



(50) 



where i ?2 is a constant independent from 5 and ffy 



e(6, ffy) — > 0 as S — 0 for any ffy (51) 

and q * = (® k qk, where (£? fc ) is the matrix of the left eigenvectors of the matrix B 
defined by (30), computed for x = 0 and t = 0. 

In order to determine the matrix (Oik), we need to give upper bounds for the 
G[. The first two derivatives G[ and G' 2 are trivially bounded choosing the matrix 
0 — H with 

H = (||| (0,0, u(0,0))) (C jfc (0, 0, u(0,0))) , (52) 

where (£ J ' fc (0,0,u(0,0))) is the inverse matrix of the matrix of (£9 fc ). 

It remains to show that G 3 can be bounded by the same matrix H, i.e. 



G' s (t) = -G 3 (v(t,x(t))) 



da n — 

8t K ^ 



d x(t) rt 
dt t j q 



v 2 (t,£)d£ + v 2 (t,t)^ + ^ ^v 2 (t,£)d£ 

< max{g 3 ,93 +3 } (1 +e((5,fii)) + R 2 . (53) 



In fact, the estimates 



v 2 (t,t) 



x(t) 

t 



< vl + (2v 0 q 2+ 3 + ql +3 t) t, 



x(t) [ f d 
t j o 

are trivial. 

On the other hand, 

ML 01 < + t max 

T<t 






< (vo + < 72 + 3 ^) Q2t 






+ 1 max 
x<£ 



dr ( 



( 54 ) 

(55) 

<v 0 + 2tfti, (56) 
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and 

< 2y + 2q 2+3 , (57) 

whence there follows 

d x( t j ft 

J v 2(t^) d ^ < ( 2v o + 2g 2 +3*)( v o + 2tOi). (58) 

Substituting qi by q* , we finally get that the matrix H given by (52), namely 




x'(t)t — x(t) 
£2 



is the characterizing matrix. 

We have now to compute the norm defined according to [4] [2,(4. 12)], namely 



I# Imin = inf \jH-y 

7=diag(7j),7i7=0 



~1| 



(60) 



where, denoting H = ( 6 ik ), \H\ = max/ ^ =1 \0 ik \ . 

It is a matter of straightforward computation to show that H just fails to 
satisfy the sufficient condition for the C 1 resolubility of the problem, since the 
above norm is exactly equal to 1 (the sufficient condition being |#j m in < 1.) 

Then we have to relay on the solvability results in the higher regularity classes 
C N + 1 given by the conditions in Chap. 7 of [4]. 

We want to show that these conditions are satisfied for N = 1. 

In order to do this, we rewrite the boundary conditions as 

ul(t,x(t)) = ~u* z -Po = Ti, (61) 

u 2 (t,0) = <p 0 + — « - Vpa/J-p(t, 0)) = r 2 , (62) 

a 

ul(t, 0) =u\- 2y/pafip(t,Q) = r 3 . (63) 

(64) 



Then the C 2 resolubility condition consists in showing that the matrix 



H x =Hq = 



( o 

ar 2 

du\ S1 

I r 

\dul 



where 






A? 






ar, , 

du* 

0 



A?-A° 



ft = 



A° — *o 

A^ 



ar, \ 

dul ^ \ 

0 

0 ) 



0, <r 3 



A^-A° 

A^ 






(65) 



( 66 ) 



satisfies |i7i| m i n < 1. 
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A straightforward computation shows that |#i| m i n = \ whence we obtain 
the following result: 



Theorem 6.1. There exists a sufficiently small 6 such that problem (35)— (37), (26)- 
(28) and (34) admits a unique C 2 solution in D(5). 

The general case p s ^ pf can be handled by substituting (25) with the more 
general boundary condition 



v(t, 0) - Kp x (t, 0) 



1 — — ) <Pov{t, 0 ), 

Pf J 



(67) 



which gives 



p(t,x) 



p{t, 0 ) 



1 - 



1 f x{t) 0 

Po-ftJ v{£,t)dt+ 0 ), 

l r x (t) 

Po- K J o dt > 



( 68 ) 

(69) 



where 0 = (l - ^-) ip 0 . 

A straightforward computation shows that the functional p defined in (68) 
is still a Lipschitz continuous functional of the solution, so that Theorem (6.1) 
applies in this case as well. 
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Transmission-Stefan Problems Arising in 
Czochralski Process of Crystal Growth 

Takesi Fukao, Nobuyuki Kenmochi, and Irena Pawlow 



Abstract. We consider a transmission problem of two parabolic partial differ- 
ential equations as a mathematical model of the Czochralski crystal growth 
process. Originally this process is a three-phase system and involves the three 
interfaces of solid- liquid, liquid- gas, and solid- gas, which should be determined 
as a part of solution from the mathematical point of view. Nevertheless, as- 
suming that the solid-gas and liquid-gas interfaces are prescribed, we propose 
in this paper a transmission problem composed of a linear heat equation in 
the gas region and a nonlinear degenerate parabolic equation in the material 
region as a simplified mathematical model of the Czochralski process. 



1. Introduction 

The Czochralski pulling method is widely used for the production of a column of 
single silicon crystal from the melt. Mathematically, this is a free boundary problem 
with three phases, solid, liquid, and gas. These locations should be determined by 
some balance laws on their interfaces as a part of the solution of the problem. This 
process has many interesting physical aspects; for instance, there is neither a phase 
transition between gas and liquid nor between gas and solid, and the solid-liquid 
phase transition occurs on its interface, which means that the balance equation on 
the solid-liquid interface is of the Stefan type and those on the gas-liquid or gas- 
solid interface are of the usual transmission type. Many authors have studied this 
process from various mathematical and mechanical points of view (cf. Crowley [2], 
Francu [5], Kenmochi and Pawlow [11], Pawlow [12], and so on). In connection with 
this process, a transmission problem between two kinds of degenerate parabolic 
equations has been recently proposed by Fukao, Kenmochi, and Pawlow [7], in 
which the main idea is to consider a transmission condition on the interface of the 
material and gas regions. This problem has been solved by the abstract theory 
of evolution equations governed by time-dependent subdifferential operators. But 

2000 Mathematics Subject Classification. Primary 35K65; Secondary 80A22. 
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this is incomplete as a model of the crystal growth process from some physical 
point of view. For example [7; Remark 2.1], and so on. Therefore, in this paper 
we propose a new model for the Czochralski pulling process, which is physically 
more acceptable; in our new model, the transmission condition on the gas-material 
interface requires the continuity of the temperature and the balance of the heat 
flux, and we still suppose, just as in [6, 7, 8], that the movement of the gas-material 
interface is prescribed smoothly in time. The last assumption is based on the fact 
that the crystal radius (hence the shape of the crystal) can be technologically 
controlled by a suitable choice of the pulling velocity and of the thermal situation 
in the gas region. 

We use the following notation (see Fig.l). For 0 < T < oo and t E [0,T], we 
let $1 C R 3 be a bounded domain with smooth boundary T := dft and 

n:=n m (t)un g (t)ur gm (t). 



r g(t) 




£lm(t) := &e(t) U Q s (t) U S(t) is the 
material region, in which the liquid region 
Qe(t) is separated from the solid region Q s (t) 
by the solid- liquid interface S(t). 

Q g (t) is the gas region. 

r gm(t) := T g e(t) U T gs (t) is the interface 
between Q m (t) and which consists 

of the gas-liquid interface T g e(t) and the 
gas-solid interface T gs (t). 

T := T e (t) U T g (t), where Te(t) (resp. 
T g (t)) is the subset of T on which the liq- 
uid (resp. gas) touches T; hence r m (£) := 
dn m (t) = r £ {t) U T ge (t) U T gs (t). 



Note here that our geometrical assumptions ensure that in our model the solid 
region is detached from the crucible boundary T. Moreover we use the following 
notation: 

i/ = v(x) : 3-dimensional unit vector normal to T at x E T, 

:= : 3-dimensional unit vector normal to T gm (t), pointing to £l g (t), 

n + := n + (t,x) : 3-dimensional unit vector normal to S(t), pointing to 
v~ v~(t, x) ~ -i^ + , n“ := n _ (t, x) = — n + , 

Q := (0 ,T)xft, E:= (0,T)xT, 

Qi '■= (J {t}xni(t), i = m,t,s,g, 

te(o,T) 

:= [J {tjxTiit), i = m,£,g,gm,ge,gs, S := |J {t} x S{t). 

t€(0,T) t€(0,T) 
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Next, we denote by v := us m (£,x) and vs := vs(t,x) the normal speed of 
r m (t) at (t,x) G and of S(t) at (t,x) G 5, respectively. Then 4-dimensional 
(with respect to the (£,x)-space) unit normal vectors and n + , outward from 
Q m to Q g and Qe to Q s are given by 



— n+ = 



-(-v s ,n' 



(KJ 2 + 1)* (H 2 + l)* 

In this paper we assume that the material domain Cl m (t) depends smoothly 
on time t in the following sense: 

(Al) There is a transformation y = y(t,x) := (yi(t,x),y2(t,x),ys(t,x)) which 
is a function of C 2 -class from Q into R 3 , satisfying that y = y(t,x) is a 
C 2 -diffeomorphism from Q onto itself for each t G [0,T] and 

y (t,Qi(t)) = fii(O) for all t G [0,T] and i = m,g, y(0, •) = I (identity) in ft. 

Also assume that the convective fields v m and v g which occur by the defor- 
mation of the domains f l m (t) and £l g (t) are prescribed, and they satisfy (A2): 

(A2) Vi G C 1 (Qi) 3 and the following properties are satisfied. 

divv*(£, •) = 0 in f li(t) for all t G (0,T), i — m^g, (1.2) 

where div means the divergence with respect to the space variable x, 

V m • = V„ • u + = v Sm on (1.3) 



v m • v = v g ■ v — := 0 on E. (1.4) 

For the convective vector fields v* on Qi, i — m,g, the assumptions (1.2)- 
(1.4) are quite natural from the physical point of view. Under these conditions, 
we consider the following system, where Oi := 6i(t,x) is the temperature on Qi , 
i = £,s,g, 

Ci(0i,t + v* • V6i) - kiA0i = f mQi, i = £,s,g, (1.5) 

6 £ = 6 s = 0, = L ( v m * n + - v s ) on 5, (1.6) 

where and hereafter := ( d6i/dt ), V#i := (d9i/dx i, dOi/dx 2 , d6i/dx 3), A 0i := 
Sj=i (<9 2 #i/<9x 2 ), d0i/dn + := V0i • n + for i — £,s,g. In (1.5) and (1.6), we take 
Vi = Vm for i = s,£. Moreover, the positive constants Ci,ki ( i = £, s,p), and L 
are positive constants which are the specific heats, heat conductivities and latent 
heats in Qi. A heat source / is given on Q. Note that (1.6) is the Stefan conditions 
taking into account the convection v m , where the phase change temperature is 
supposed to be 0 for simplicity. Moreover, on the gas-material boundary H grn we 
require the following transmission condition: 

@ gi ^9 ()l/~ — ^ ^ — £) S. 



(1.7) 
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The boundary condition on S is given by 

d6 

ki-^+ n 0 6i=p onEi,i-£,g, (1.8) 

where no is a positive constant, p is the boundary datum prescribed on E, and 
the initial conditions are prescribed as follows: 

6i(0, •) = 0 iO on fti(O), i = £,s,g, S(0) = S 0 , (1-9) 

where So is the initial location of the solid-liquid interface and 0io, i = £, s,g, are 
the initial temperature on f^(0) which satisfy the compatibility condition 

0io > 0, 9so < 0 and 0 £o = 9 s o = 0 on So- (1.10) 

As a remark, when 9{ = 6 g is replaced by kiOi = k g 9 gi for i = £,s, in (1.7), 
the problem was treated in Fukao, Kenmochi and Pawlow [7] and an existence- 
uniqueness result was established in the framework of the abstract theory of evo- 
lution equations of time-dependent subdifferentials. However, the same approach 
cannot be applied to the system (1.5)— (1.9) proposed above, and one of the diffi- 
culties in its mathematical treatment comes from the transmission condition (1.7). 
The uniqueness argument, based on a dual method [7; Section 5], does not seem 
to work in this context, either. 



2. Weak Formulation and Existence Result 



In this section we give a weak formulation of system (1.5)— (1.9) and state an 
existence result for it. We begin with defining the enthalpy function u on Q and 
the heat conductivity function k(r) on R by 



u 



u{t,x) < 



= c s 9 
G [0, L] 

— Cg9 -f- L 
= c g 6 



if x e 9 < 0, 

if x E Q m (£), 0 = 0, 
if x E £1 m (t ), 6 > 0, 
if x E f^(£), 



k(r) 



k s if r < 0, 

ki if r > 0, 



( 2 . 1 ) 



( 2 . 2 ) 



where for each t E [0,T] we put 0(t,x) := 9i(t 1 x) if x E £li(t) for i = £,s,g. By 
using these functions u, 9 and k our problem (1.5)— (1.9) is reformulated in the 
following weak sense (cf. [7]). 



- u(rf t + v • Vrj)dxdt + V0 • Vgdxdt 

+no / 9rjdTdt = / frjdxdt+ / prjdTdt-\- / uorj{0)dx for all rj E W, 
Je Jq Je Jn 



(2.3) 



where W := {rj G t](T, •) = 0 on Cl}, and 

v(t) :=v m (*)xn m( i)+v g (*)Xfj^, u 0 := c s e o xn s (o) + (ceQo + L)xn e (o)+c g 0oXn^, 
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Xni{t), X ^ ~ (t y are the characteristic functions of Cli(t) for i — £,s,m and of 
for each t € [0,T]. 

In fact, the variational identity (2.3) is derived in the following way. Assume 
that 6i , i = satisfy system (1.5)— (1.9) and 0£ > 0 on 6 S < 0 on Q s , 

and let rj be any test function in W. First, with the help of the Green-Stokes’ 
formula and the relations = (\v^ m \ 2 + 1 ) l / 2 dTi(t)dt for each i, we see from 
conditions (1.2)-(1.4) and (1.6) with expression (1.1) of that 



/ • V6i)rjdxdt 

d Qi 

= - ce0e(r) t + \e-'\7r])dxdt+ / ct6tq(v + ) t dZ g i + / Ci9 eo r](0){-l)dx 

JQz Jn t (0) 

+ [ [ ceO(T](v( ■ i> + )dT g e(t)dt 

do JYgzit) 

= - c e 6e(r]t + ve ■ Vyfidxdt - / Q0«jf?(O)cfa:, 

J Qp J 



where (P+Y denotes the t-component of in order to get the last equality we 
used 



(O+YdTigi = -v^ m dr g £{t)dt = (v £ • i/+)dTg£(t)dt on E g £. 

We have a similar result for i = s, g, so that for i = £,s,g, 

/ Ci(Oi it +Vi-'VO i )ridxdt = - CiQi(r] t +Vi-X7ri)dxdt- Ci6 i0 r](0)dx. (2.4) 
JQi J Qi 0 ) 

Therefore, summing up (2.4) for i = £, s,g, and using the enthalpy u , we have 



I Ci(6i :t + 'Vi • V9i)rjdxdt = — u(rj t + v • Vrj)dxdt 
— „JQi ’ JQ 

/ u 0 r](0)dx + / L(rj t + v m • S7rj)dxdt + / Lrj(0)d: 

J d Qi ddp( 0) 



i=t,s,g 



(2.5) 



Next, by (1.6)— (1.8) we have 

— / keAOtrjdxdt — / k s A6 s r]dxdt — / k g AO g rjdxdt 

d Qp d Q s d Q g 



= f Q {W)Xn m (t) + k g x^}v0-Vr)dxdt + 

— f L(v • n + — vs)rjdS(t)dt. 
ds 



J (no# — p)rjdT(t)dt 
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Thus we have 



~ Y J kiAOir)dxdt = j {k(0)xn m (t)+k g x^fi}v6-'Vr)dxdt 

i=e ’ s ’ 9 Qi Q 9 ( 2 . 6 ) 

+ / (no# — p)rjdT(t)dt — / L(y • n + — vs)rjdS(t)dt. 
js Js 

Here add (2.5) and (2.6). Then it follows from (1.5) that 

- J u(rj t + v- Vrj)dxdt + J jfc(#)xQ m (t) + k gXnjjj} V# • Vpdxdt + n 0 J OrjdTdt 

+ / L(rj t + v m • Vrj)dxdt + / Lrj(0)dx - / L(v m • n + - vs)rjdS(t)dt 
J Qg o) Js 

= / fr)dxdt+ / p77drdt+ / uor)(0)dx. 

Jq je jq 

Here, with the help of the Green-Stokes’ formula, we see from conditions (1.2)- 
(1.4), and the expression (1.1) of n + and the relation dS = (l^sl 2 + 1 ) 1 / 2 dS'(t)dt 
that 

/ L(rj t + v m • Vrj)dxdt + / Lrj(0)dx = / L(v • n + — vs)rjdS(t)dt. 

Jqh o) Js 

Therefore the variational identity (2.3) is obtained. 

Here we define a weak solution of our problem (1.5)— (1.9) . For simplicity, we 
denote by V the Sobolev space i/ 1 (fi) with the norm | • |y given by 



\u\ v := 



! \Wu\ 2 dx + no / |n| 2 dr > for all u G V, 
o Jr J 



which is equivalent to the standard norm of H 1 (Q). The dual space of V is denoted 
by V*, with duality pairing denoted by (•, -)y*y. Then we have the usual dense 
and compact imbeddings: 

V<^l 2 (Q)^V*. 

Next, we define a multivalued function A(t, x, r) on [0, T] x fl x R into R by 

( c s r if x G fi m (i), r < 0, 

M if x G r = 0, 



A(t,x,r) = 



C£r + L if x G fi m (t), r > 0, 



^ c g r if x G £l g (t). 

By using this function A and the temperature #, the enthalpy u can be given as a 
measurable selection of A(t, x,r), namely 

u(t,x) G A(t,x,0(t,x)) =: v4(#)(£,x); 

note here that 6 is a single valued function of n, since A has a single valued inverse. 
Now we are in a position to give the weak formulation of our problem (1.5)- 

(1.9). 
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Definition 2.1. A pair [0,u] of functions 0 and u : [0,T] — > L 2 (Q) is called a 
weak solution of (1.5)— (1.9) if {0,u} satisfy the following conditions: 

(Dl) 0 G L°°(0, T; L 2 (Q)) fl L 2 ( 0, T; V), u G W 1 ’ 2 ^, T; P*) n L°°(0, T; L 2 (ft)). 
(D2) u G A(0) and 0 and u satisfy the variational identity (2.3). 

Our main result is stated as follows. 



Theorem 2.2. Assume that (Al) and (A2) hold . Let f G L 2 (Q), p G C ,1 (E), 0o eV 
and uq G L 2 (Cl). Then there exist at least one weak solution {0,u} of (1.5)— (1.9) . 



Our assumption (Al) ensures that by the coordinate transformation y = 
y (t,x) the time-dependent domains Clift), i = m,g, are mapped onto time- 
independent ones. In Section 3 we consider the system (1.5)— (1.9) written in 
the new coordinate (t,y). In Section 4, we show the existence of a weak solution 
of the transformed system by the so-called time discretization method. Note here 
that for each u G W 1 ’ 2 ^, T; P*) n L 2 (0,T;P) with u(0) = u 0 in L 2 (Cl) and for 
r/GP 



/ (u'(t),r)(t)) v *ydt = - / (u(t),rj\t)) L 2 (n) dt- (u 0 ,r)( 0 )) L 2 (n) , 

Jo Jo 



where (-,-)l 2 (^) denotes the usual inner product in L 2 {Cl) and u! — ( du/dt ). By 
this relation we see that the weak variational form (2.3) is equivalent to 

[ (u f (t),T}(t))v*,vdt+ [ (K(t,0(t)),ri(t)) v *ydt + [ (G(t,u(t)),ri(t)) v *ydt 
Jo Jo Jo 



T 

(f*(t)Mt))v,vdt for all T] £ L 2 (0, T; V), (2.8) 

u( 0) = uo in L 2 (0), (2.9) 

where for each t G [0,T], K(t, •) : P — >• P * is a nonlinear second order differential 
operator and G(t , •) : L 2 (Cl) — > P* is a linear operator given for each z, z\ G P 
and Z 2 G L 2 {Cl) by the following formulae: 

(K(t,zi),z) v .y := j |fc(zi)xn m ( t ) + Vzi • Vzdx + n 0 j z 1 zdT, 




(■ G{t,z 2 ),z) v * tV := - / z 2 (v{t) -Vz)dx, 

J n 

and /* £ L 2 (Q,T- V*) is defined by 

{ f*(t),z) v *y := (f(t),z) L 2 { o) + J^p(t)zdT for all z £ V. 

It is clear that (2.8) can be written in the form 

u'(t) + K(t,0(t)) + G(t,u(t )) = f*(t) in P * for a.e. t G [0,T]. 



( 2 . 10 ) 
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3. Transformation of (1.5)-(1.9) 

In the mathematical treatment of our problem (1.5)— (1.9), one of the difficulties 
comes from the fact that the domain depends on time. In order to overcome this 
difficulty, we write the equations (1.5)— (1.8) in the coordinates (£,?/), with y = 
y (t,x) given by assumption (Al). For convenience, we denote by x = x(t,y) the 
inverse of y (t,x). 

Actually assumption (Al) ensures that for each t G [0,T] 

y(t,Qi(t)) = Qi(0), x(t,Qi( 0)) = Qi(t ), i = m,g, 



y(t,ri(t)) = r*(0), x(t,ri(0)) = r i(t), i = m,g. 

Moreover we put 

0(t, y) := 0(t, x(t, 2 /)), u(t, y) := u(t, x(t, y)), 

and define a time-independent multivalued function A := A(y,r) on x R by 
putting 

( c s r if ye fi m (0), r < 0, 

J [°- L ] if y e n m (0), r = 0, 



A(y,r) := 



c e r + L if y £ fl m (0), r > 0, 



^ c g r if y G n s (0). 

Now, let {u, 6} be a weak solution of (1.5)— (1.9) and define u and 6 as above. 
Then it follows from (3.1) that 

u(t,y) <E A(y,6(t,y)) for a.e. (t,y) € Q. (3.2) 



Moreover the inverse B := B(y,q) of q := A(y,r ) is a singlevalued function given 
by 

— if y € fl m (0), q < 0, 

C-s 

0 if y e n m (0), q e [0,L], 

- — ~ if ye 9m(0), q > L, ( 3 ‘ 3 ) 

— if y e fi s (0). 

C 9 

Hereafter we use the abridged forms A(6) and B(u) for A(-,0) and B( •, w), respec- 
tively. Clearly A and B are bounded and maximal monotone operators in L 2 (Q) 
and A satisfies the following inequality 

(z* - Z 2 ,Z! ~ Z 2 )l*(Q.) > C\\z\ - z 2 \ 2 L 2 {q) , (3.4) 




for all Zi G L 2 ( it) and all z* G A(zi) for i — 1,2, where C\ is a positive constant. 
Also, we see that the graphs of A and B in L 2 (Q) x L 2 ( Q) are strongly- weakly 
closed, that is, A and B are demi-closed. As a remark, we can not identify V = 
H 1 (Q) with the space transformed by y = y(t,x) in the strict sense, although V 
is invariant under this coordinate transformation. Nevertheless, for simplicity, we 




Transmission- Stefan Problems Arising in Czochralski Process 



159 



use the same notation V = H 1 (Q) to indicate the space given after transformation, 
using the symbol x and y separately. 

Now for each t G [0, T], we denote by J x(t) := J x ( t ){y) (resp. J yW := J y ( t )(x)) 
the Jacobian matrix of the transformation x = x(£, y) (resp. y = y (£, x)). Also, we 
denote by x = x(£, y) the restriction of x = x(£, y) to the boundary T. Further, we 
denote by dx = \\J x ^(y)\\dy, dT(x) = \\J^ t )(y)\\dT(y) the relations between the 
volume elements dx and dy in fi, and the surface elements dT(x) and dT(y) on T, 
respectively. Actually | |«/ x (t) ( 2 /) 1 1 ls the determinant of J x (t)(y) an d \\Jst(t)(y)\\ is 
similarly expressed by using a suitable 2-dimensional local coordinate system on 
T. Under this setting, (2.8) is transformed to the following form: 

[ (d f (t),£(t))v*ydt + f (K(t,d(t)),£(t))v*ydt + ( (G(t,u(t))^(t))v* ydt 
Jo Jo Jo 



= [ 2 (f*(tU(t))v\vdt for all £ G T 2 (0, T; U), 

Jo 

subject to the initial condition 

u(0,y) = u 0 (y) for a.e. y G fi, 

where for each z\ G V the operators K(t,-) :V — > V* is defined by 

/* ^ Q — 

( K(t,z0,z)v,v ■= J { k ( 2 ^ m ( 0 ) + k g x^} aijiO^^dy 



(3.5) 

(3.6) 



,Q ij = i 



~ J2 { fc ( fl )Xn ra ( 0 ) + k 9Xn^} 

J JkxWlI 



+ n0 / r ^||/ x(t) || 



d Zl 9||J xW || Z 

dyj dyi ||Jx(t)|| 

dF(y) for all z e V, 



dy (3.7) 



and for each z 2 G L 2 (Q) the operators G(t , •) : L 2 (Q) — ► V* is defined by 

{G(t,z 2 ),z))v,v := - / z 2 (w(f) ■ V y z)dy 
Jn 

+ jU (_ ' 9 H^ t )H +w(t) ■ V y (||J x(t )|l)) n / u dy for all z € V. 



in \ dt 
Moreover /* € L 2 (0, T:V*) is given by 



l|Jx(t)ll 



(f*(t),z)v*y := [ f(t)zdy+ [ p(t)zdT(y) for all z E V, 
Jn Jr 

\\Jm(v)\\ 



f(t,y ) := f(t,x(t,y)), p{t,y) := p(t,x(t,y)) 



ll^( t )(y)l| ! 



aij(t,y) := ^2 ^ L ( t » x (*>?/))^ Z -( < > x ( t >2/)) for FJ = 1 > 2 > 3 > 



<% 



h=l 



dx h 



dx h 



with 
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w := (wi,W 2 ,Wz) with 

w j(t,y) '■= (-^f(t,y)+Vi(t,x(t,y))\ for all (t,y) e Q. 

i = 1 ' ' 

In fact, perform the coordinate transformation from (t,x) to (t,y) in (2.3) and 
take r](t, x(£, i/)) 1 1 J x (t)(y)|| as test function £. Then we have (3.5). 

As regards the operators K and G, we have the following lemma. 

Lemma 3.1. (i) If z n — > z weakly in V and t n — > t as n — > +oo, X(£ n , z n ) — > 
K(t,z) weakly in F*. Moreover, there exist positive constants Ci,i — 2,3,4, such 
that 

\K(t,zi)\y* < ^l^ilv for all zi G F and t G [0,T], (3.8) 

(K(t,zi),zi) v .y > Ci\zi\y — C' 4 | 5 i||_ 2 (rj) for all zxGV and t G [0,T]. (3.9) 

(ii) If v n weakly in L 2 (0,T;V) and in L 2 (0, T;L 2 (Cl)) as n — > +oo, then 
K(-,v n ) — > if(-,v) weakly in L 2 (0,T; F*). 

(in) If z n — > z weakly in L 2 (Q) and t n — > t as n — > +oo, G?(£ n , z n ) — > G(t, z) 
weakly in F* . Moreover, there exists a positive constant C 5 s?xc/i that 

\G(t,z 2 )\v* < C 5 \z 2 \ L 2 { ^ for all z 2 G P 2 (fi) and f € [0,T]. (3.10) 



Proo/. Putting k(r) := / Q r k(s)ds, we reformulate the definition (3.7) of K{t,z\) 
as follows. For each t, € [0, T] and each 2 , G V, 



{K{t,Zl),z) V *y 



/ FI a ij (0 

do m (o ) ij=1 

3 

— / ^ ' a ij(t) 

jQ rn{0) ij = 1 



dk(z\) dz 
dyj dyi 



dy+ kg^aijft) 
Jnjo) n~. 



■j ^ an 9 ( o) i; , =1 

afc(fl)a|kx( t )|| I 



dy, dyi 



dy 



f 3 

/ kg a ij{t) 

i,j=i 



dyj dyi ||J x(t) || 
9||J x ( t )|| z 



dy 



dyj dyi ||J xW || 



dy + no J z lZ \^kdY 



u 



x (0 i 



for all z G F. If z n — > z weakly in F, t n — > t as n — > +oo, then z n — > z in L 2 (Q) 
by the compact imbedding of F into L 2 (fl). Also, we have k(z n ) — > fc(z) weakly in 
F by virtue of the definition of k. This implies that K(t n , z n ) — > K(t, z) weakly in 
F*. It is easy to deduce (3.8) and (3.9) from (3.7) and the definition of k. Similarly, 
(ii) is obtained from the expression of K. Finally ,(iii) is also inferred from the 
definition G. □ 



The system {(3.5), (3.6)} is referred to as (P) and in the next section its 
solvability is discussed. 
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4. Construction of a Weak Solution 

In this section we employ the time discretization method in order to solve (P). 
For each fixed N E N, we put h := T/N, tf := ih for i = 0, 1, . . . , N, 9q := 
0 O , Uq := u 0 , and derive the following problem. For each i, find the pair of functions 
{Oi+iiUi+i}, with 0^ E V, G L 2 (fl) and G 4(0^), such that the 
following equation holds: 

( 1+1 . “ + (<5(^ 1 ,U^. 1 ),z)y. j V 

\ n / L 2 (n ) ~ AT 

= (fi for all z £ V, (4.1) 

that is, 

^ + K{t? +1 ,0? +1 ) + G(t? +1 ,u» i) = ^ in V*. (4.2) 

Here is the discrete approximation of / defined by 

t N 

f*+x := 1 f* (s)ds for * = 0, 1, . . . , N - 1. 

The following lemma ensures that the above difference approximation scheme 
{9^u^}fLi can t> e i n f ac t constructed. 

Lemma 4.1. Given 9 ^ G V and G L 2 (Q), i < N — 1, there exist 9^ +1 G V and 
1 G L 2 (Q) with uf +1 G A(0^ 1 ) such that 9^_ ± and uf^ satisfy the equation 
(4.2) ( hence (4.1)). 

Proof. Fix any t G [0,T]. We define a multivalued operator 4> : V — > F* by 

:= jfjl + jr(t,i) + G(t,Ii) € H*;2i G i(z)} for all z £ V. 

Then we see from Lemma 3.1 that for each z E V, $z is a non-empty bounded 
and closed convex set in F*, since A is maximal monotone and bounded in L 2 (Q). 
Also, by Lemma 3.1, its graph is weakly closed in V x I/*. The boundedness and 
coerciveness of are immediately obtained. Thus (4.2) is solved by virtue of the 
abstract results on the surjectiveness for nonlinear multivalued operators of Type 
M (see Brezis [1], Kenmochi [9]). □ 



Now, let {u^, 0^ v }£Lo constructed as in Lemma 4.1, and define the piece- 
wise linear function u N on Q generated by on [0,T], that is, 

[ u?(y) 



u N (t,y) := 



t-t. 



N 

i - 1 -N 



if t = t* ,i = 0, 1, . . . , TV, 
Ui-i(v) + ( 1 - A 1 ) u?(y) 



h 



if t £ = 1,2, 



,N. 
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It is clear that u N G C([0, T]; L 2 (Q )) for each N G N. Also, we can define the step 
functions § N , K N , u N and Q N on [0,T) by 

d N (t):=e», K N (t):=K(t?,0 f) if t G [tj^.t?), * = 1,2, . . . ,N, 

n N (t) := uf, G N (t) ~G{tf ,uf) ifte[t^_ 1 ,t^),i = l,2,...,N. 

The properties of these sequences of functions are stated in the following lemmas. 



Lemma 4.2. There exists a positive constant Cq such that 



|^| L 2 (n ) <C 6 for all N eN and i = 0,l,..., N. (4.3) 

Moreover, there exist positive constants Ci,i — 7,8, such that 

\e N \mo ,T;V) < C 7 for all N e N, (4.4) 

|fi iV |c([o,T];L 2 (fi)) < Cg for all N € N. (4.5) 



Proof. By taking 0^ as z in (4.1), we get 

{uf - + h{K{t?,0?),9?)v,v + h(G(t? ,nf ),0?)v,v 



= h(f* N ,6?) v .y for i = 1,2,..., IV. (4.6) 

From the definition of B, there exists a convex and Lipschitz continuous function 
b on L 2 (Q) such that its subdifferential, denoted by db , coincides with B in L 2 (£l). 
Then we have 

(«f - )L 2 (Q) > b(uf) - &«!), (4.7) 

G^uf li 2 (n) — Cs — b(uf) < Cio\uf |^2 (q) + C 1 0 , (4-8) 

since 6^ = B(uf) = db(uf) for i = 0, 1, . . . , TV, where C{ and C[ for i = 9, 10, are 
positive constants independent of N. On the other hand, by virtue of Lemma 3.1, 
there exist positive constants C' 3 and C' 4 depending only on C3, C4, w, and b such 
that 

+ G(t?,u?),0?)yy > C'\en 2 v - C'M^hyiy ( 4 - 9 ) 

Substituting (4.7)-(4.9) into (4.6) and summing up the results from i — 1 to 
i = j < N, we obtain 



b(uf) + (C' - e)h^2 \0?\ 2 v < ^h^biu?) + C e hJ2\fi 



i= 1 



c 9 



p*N |2 
i \V* 



+ 



2=1 



2=1 



C4C9 

Gq 



+ b(u 0 ), 



for any 5 G (0, C'3) and C e := 1/(46:). Note here that all constants are independent 
of f,Uo, and N. Therefore, applying the Gronwall lemma to this inequality, we 
derive the estimates (4.3)-(4.5). □ 



Lemma 4.3. {u N }^ =1 is equicontinuous as a family of functions from [0 ,T] into 

V*. 
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Proof. For each s,t with s < t such that s G [t™_ 1 ,t™],t G and s' <t' , 

with the help of the definition of u N (-) we have 



!«*(*) -u N (s)\ v . < 2j2 h {\K( t ?’9?)\v* + \G{t?,u?)\ v *} + ^2h\f* N \ 






+ I ^2h\G(t^ ,U^)\y, + | YMf* N \l 



Using Lemmas 3.1 and 4.2, we see that 
\u N (t) - u N (s)\v* 

< 2^/\t - s\ + 2h {C2 \& n \l 2 (o,t-,v) + C$\u N \ C ([o,T]-,L 2 {n)) + |/*|l 2 (o,t ; v*)} 

< 2 ^/| t — s\ + 2h IC 2 C 7 + C$C$T 2 + |/*|l 2 (o,t ; v*)| > 

which shows the desired conclusion. □ 



By virtue of Lemmas 4.2, 4.3, and the Ascoli-Arzela’s theorem it is easily 
seen that {u N }^ =1 is relatively compact in C([0, T]; U*), so that there exists a 
subsequence {N m } C {N} with AT m — > 00 as m — > 00 and there exists a function 
u G C([0,T]; V*) such that 

u N ™^u in C([0,T];U*), and hence U N ™ -> u in L°°(0,T; V*). (4.10) 

Making use of the convergence (4.19), we get the convergence of {Q Nrn }™ = ± 
as follows. 



Lemma 4.4. There exists a function 0 G L 2 (0, T; L 2 (f2)) such that 
0 N m i n L 2 (0,T;L 2 (fl)) as m —> + 00 , 

and the pair of {#, u} satisfies that u G A(0). 

Proof Since {0 Nm }^ =1 is bounded in L 2 (0, T; L 2 (£l)) by (4.4), it follows from 
(4.10) and (3.4) that 

0 = lim [ T {U N ™(t) - ti N ™'(t), 0 N ™(t) - 0 N ™'(t)) v *ydt 

777,777'— t-OO J q ’ 

> lim Cj \§ Nm {t) -d Nm '{t)\ 2 L 2 {a) dt. 

777,777'— >-\-oo j q v J 

This implies that {0 Nrn }^ =1 is a Cauchy sequence in L 2 (0,T; L 2 (fl)), so that 
0 Nrn converges in L 2 (0, T; L 2 (Q)) to some function 0. Also, the estimate (4.5) and 
the convergence (4.10) imply that u Nm converges weakly in L 2 (0, T; L 2 (tt)) to 
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u as m — ► +oo. This fact, together with the demi-closedness of A, shows that 



u G A(6). □ 

Proof of Theorem 2.1. We have seen above that 6 G L 2 (0, T; U), 
u G C([0,T];U*) nL°°(0,T;L 2 (fi)) and 

q in L 2 (0,T; L 2 (fi)) and weakly in L 2 (0,T;U), (4.11) 

u Nm ^ u in L°°(0,T; V*) and weakly- * in L°°(0, T; L 2 (£l)), (4.12) 

as m — * -boo. Now (4.2) implies that 

^i2 Nm (£) + K Nrn (t) + G Nrn {t) = j* Nrn {t) in U* a.e. t G (0,T), (4.13) 

where (£) := for each t G ) and i — 1,2,..., Ar m . Since {K Nrri } 

is bounded in L 2 (0,T; U*), we may assume that 

K Nm -► K weakly in L 2 { 0, T; V*) asrrw+oo (4.14) 



for some function A' £ L 2 (0, T; V*). By the expression (3.4) of A' it is easy to see 
that 

\R Nm (t) - k(t,0 Nm (t))\v < Cnh\6 Nm {t)\ v for all t £ [0,T], 

where C\\ is a positive constant independent of N. Noting this together with 
(4.14), we see from (ii) of Lemma 3.1 that K = K (-,§), since K(',0 Nrn ) — > K(-,6) 
weakly in L 2 (0 , T;U*). Similarly, by (4.10), (4.12) and (iii) of Lemma 3.1, we 
have 

G Nrn — > G(-,u ) weakly in L 2 (0,T;U*) as ra — ► -boo. (4.15) 

Now, it follows from (4.13) with (4.14) and (4.15) that 

^rU Nrn ~ru weakly in L 2 (0,T; V*) as m — » -boo. 

Lib Lib 

Furthermore, letting m — > -boo in (4.13), we see that 

f t u(t) + K(t,0{t)) + G(t,u(t)) = f*{t) inV* for a.e. t € [0, T], 

that is, {#, u) satisfies (3.2) as well as (3.3). Therefore, since u G A(0), {0,u} is a 
solution of (P). Finally we write this solution as the function of (t,x) by putting 

0(t,x) 0(t, y(t,x)), and u(t,x) := u(t,y(t,x)). 

Then we see that {6,u} satisfies (Dl) and (D2). □ 
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Quasi-static Melting of Crystals: 
Experiments and Analysis 

Martin E. Glicksman, Afina Lupulescu, and Matthew B. Koss 

Abstract. The kinetics of melting pivalic acid (PVA) dendrites was observed 
under convection-free conditions on STS-87 as part of the United States Mi- 
crogravity Payload Mission (USMP-4) flown on Columbia in 1997 during STS- 
87. The analysis of video data show that PVA dendrites melt without relative 
motion with respect to the quiescent melt phase. Dendritic fragments dis- 
play shrinking to extinction, with fragmentation occurring at higher initial 
supercoolings. Individual fragments follow a characteristic time-dependence 
derived elsewhere. The microgravity melting kinetics against which the experi- 
mental observations are compared is based on conduction-limited quasi-static 
melting under shape-preserving conditions. Agreement between the analytic 
theory and our experiments is found when the melting process occurs under 
shape-preserving conditions as measured using the C/A ratio of individual 
needle- like crystal fragments. 



1. Introduction 

1.1. Isothermal Dendritic Growth Experiment (IDGE) 

The thermodynamic driving force for crystallization in the case of a pure substance 
is the melt supercooling, AT = T m — T^, where T m is the melting point of the ma- 
terial, and T^ is the temperature of the melt established far from the heat emitting 
dendritic solid-liquid interface. The crystallization process for free growth requires 
that the latent heat released be transported away from the solid-liquid interface by 
thermal conduction to the cooler (supercooled) melt. Theories [5]-[14] of dendritic 
growth in pure materials are best tested under conditions of conduction-controlled 
heat transfer, a situation possible only where gravity is reduced to nearly zero, 
where buoyancy-induced natural convection is virtually eliminated [3] -[9]. The 
Isothermal Dendritic Growth Experiment, or IDGE, provides basic terrestrial and 
microgravity data on the kinetics of dendritic solidification. 

The IDGE flew in near-Earth orbit aboard Columbia three times, and proved 
that Ivantsov’s conduction solution [9] describes steady-state dendritic growth for 
the test materials succinonitrile (SCN) and pivalic acid (PVA). These cubic or- 
ganic crystals are 1) transparent, 2) easily purified, and 3) have conveniently low 
melting points. IDGE instruments, on board the STS-87, provided CCD images 
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Figure 1. Video sequence (frame height ~ 1.5 cm) showing the 
progressive melting of PVA dendrites in microgravity. As the den- 
drites fragment, individual crystallites remain motionless within 
the melt. Video data were analyzed frame-by-frame for the melt- 
ing of the needle crystal shown in the second panel and reported 
in section 5. 

(telemetered to Earth during each flight), NASA-processed 35-mm film negatives 
(available postflight), and, for the first time, real-time streaming of 30 frame per 
second (fps) full gray-scale video data telemetered via the K-band (high-frequency) 
antenna. 

1.2. Mushy zones 

A mushy zone is defined as a two-phase region, consisting of the dendritic solid 
and its melt phase. Mushy zones occur in ingots, castings, and welds, [16], [18]. We 
studied 4-9’s purity PVA under nearly gravity- free conditions (mean quasi-static 
acceleration of 10 -7 po, where go = 9.8 m/s 2 is the normal terrestrial gravitational 
acceleration). The melting of a mushy zone in a pure substance under convection- 
free conditions should be kinetically controlled by conduction-limited heat transfer, 
provided that convection is absent. Convection, if present, alters the thermal fields 
during melting, because sedimentation occurs between the solid and the melt, 
inducing relative motion between the phases. 



2. Video data 

The freezing and melting observations analyzed and reported here were recorded 
as 30 fps video downlinked from microgravity experiments conducted in low-earth 
orbit during November-December 1997. On-board video signals generated within 
the IDGE were telemetered to Earth via Columbia ’s K-band antenna, and recorded 
by NASA using a high-resolution VTR on 8-mm tapes. Two hours of 30 fps video 
frames were transferred onto a single image buffer file as 640x480, 255 gray-scale, 
digitized images. For each dendritic freezing/melting cycle, the video data were 
exported as individual .tif files for every tenth frame. 

For the last minute of melting, where the thermal fields in the IDGE ther- 
mostat remain at a stable temperature, approximately 1.8K above T m , individual 
.tif files were exported for every video frame. When melting begins, the dendritic 
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Video Frame: Time = 33 




Figure 2. Sample digitized video frame for image analysis of a 
melting PVA dendritic fragment. The image analyzer calculates 
the equivalent ellipse for the fragment, allowing determination at 
each time of its major axis length, C(t), and its C/A ratio. 



side-arms first shorten, and then larger secondary branches detach from the pri- 
mary stem. Figure 1 shows several video frames representative of a few stages 
of the total melting sequence. In microgravity, it is important to note that the 
individual crystalline elements of the mushy zone remain motionless. 



3. Data Analysis 

3.1. Image processing 

The shapes of the melting dendritic fragments were determined using commercial 
image analysis software. Figure 2 shows the shape of a melting dendrite stem out- 
lined using this software tool. The equivalent ellipse of the two-dimensional image 
of this fragment is calculated on the basis of its major axis, C, and minor axis, A. 
The particles are of course three-dimensional objects approximated here as smooth 
prolate spheroids of decreasing size. Image analysis conducted on sequential video 
frames allowed the measurement of the major and minor axes of the equivalent 
ellipsoidal fragment as a function of time. 



4. Theory 

Quasi-static and moving boundary solutions are well known [19] [2] for the growth 
of spherical crystals for which ( C/A = 1). For the more general cases of freezing 
ellipsoids and hyperboloids, moving boundary solutions were developed by Ham 
[7]. Ham’s similarity solutions for ellipsoids apply to the case of crystal growth, 
not melting. We developed and published elsewhere a quasi-static model for the 
conduction melting of prolate spheroids [4] . 
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The quasi-static melting or freezing of a crystal is based on determining the 
heat current entering or leaving its static solid- melt interface, and then formulat- 
ing kinematic equations for the crystal’s shrinkage or growth. The rate of change 
of the particle’s volume is assumed to occur slowly enough so that the surrounding 
transport field has sufficient time to adjust to the changing particle. The thermal 
transport field used only approximates the time-dependent solution to the con- 
duction equation as a quasi-static solution based on Laplace’s equation. The key 
condition determining the applicability of such approximations is that the Stefan 
number — the intensity measure for melting or freezing — is small, i.e., St 1. 

The axis-symmetric temperature field developed around a melting particle, 
T(£,r/), is non-dimensionalized as the potential, $, defined as 



(£.»?) 



)-T n 

Too-T m 



(1) 



where is the temperature of the melt infinitely far away from the particle. Here 
the three-dimensional coordinate system chosen is the separable prolate spheroidal 
system (£,rj,(p). These coordinates correspond to isotherms as (£ = const.), and 
to heat flow surfaces as (rj = const). The crystal-melt interface is defined as the 
isotherm T(E,rj = T m ), which defines the prolate spheroidal interface £ = E. 

The rate of melting (or freezing) is determined by the sign and magnitude of 
the denominator in Eq.(l). A dimensionless parameter based on that temperature 
difference is the Stefan number, defined here as 



T —T 

± OG - L m 

= &H f /cr 



( 2 ) 



where A Hf is the molar enthalpy of fusion, and is the molar specific heat of 
the melt at constant pressure. Specifically, the Laplacian potential surrounding a 
prolate spheroid with the fixed interfacial potential $(S) = 0, embedded in a melt 
where the far-field potential $(oo) = 1, is given in these curvilinear-coordinates as 



= 1 - 



logfrj 
log fry' 



( 3 ) 



A kinetic expression has been found in [4] for quasi-static melting or freezing 
of prolate spheroids, namely 



C(t) = 



cl 



1 6 aeSt 



Slog I 

where C/A = H/-\/£ 2 — 1. Equation (4) may be non-dimensionalized as 



Co 



( 4 ) 



\/l - Kprol ■ St ■ Fo , 



( 5 ) 
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Time [s] 

Figure 3. Melting kinetics of a dendritic pivalic acid (PVA) crys- 
tal fragment (C/A = 12) in microgravity. Experimental melting 
data (open symbols) obtained through digital analysis of 230 dig- 
itized video frames measure C(t) versus time. The solid curve is 
the theoretical prediction from Eq.(4) fitted with a Stefan num- 
ber equal to 0.057. The fit with quasi-static theory is based on 
the observed extinction time C = 40s. 

where the kinetic coefficient, K pro i , is defined as 

and the Fourier number (dimensionless transformation time) is defined as 

Fo=^t. ( 7 ) 



5. Results 

5.1. Melting rates 

We analyzed two PVA crystals melting in microgravity, each formed originally as a 
dendrite created in separate IDGE growth cycles. These dendrites grew from pure 
molten pivalic acid that was supercooled 0.400K and 0.42 IK, respectively. The 
mushy zones consisted of approximately 4% solid when melting was initiated. The 
time needed to melt this mushy zone completely was approximately 12 minutes. 
The last large fragments to melt were selected for quantitative analysis. These 
fragments had initial lengths of Co = 0.760cm and 0.695cm, respectively. 

To calculate the melting kinetics of these fragments one requires the C/A 
ratio, the thermal diffusivity of the melt, and the Stefan number imposed by the 
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Figure 4. Melting kinetics of a dendritic pivalic acid (PVA) crys- 
tal fragment (AT = 0A21K) in microgravity. 

melting cycle. The thermistors arranged within the IDGE thermostat were op- 
timized for measuring the supercooling prior to dendritic growth quite precisely 
(±0.002K). The subsequent temperature history within the growth chamber, espe- 
cially during the melting portion of an experimental growth cycle, is, unfortunately, 
less certain, because temperature uncertainties arose from the lack of synchroniza- 
tion between the video clock and the mission clock. In view of this situation, one 
may instead adopt a self-consistent method for estimating the Stefan number in 
the melt during each melting sequence. The basis of the value of the self-consistent 
Stefan number is found from the C/A ratio (which was equal to 12 for the entire 
melt cycle), the initial length, Co = 0.760, and the length C(t) at a later time £, 
at which the fragment has decreased due to melting. In this instance the process 
of melting was followed to extinction, so C(t) = 0, and t = 40s. 

The crystal melting data reported in Figure 3 were fit in their entirety using 
the self-consistent Stefan number, which corresponds to the melt being superheated 
0.63K above its equilibrium melting point, T m = 35.97 K. The melting kinetics 
predicted over the entire the lifetime of this dendrite fragment using eq.(4) is in 
good agreement with experiment. The video data show only small disparities from 
the theoretical predictions. 

The second crystal analyzed, initially formed at a supercooling of 0.421K, 
did not melt with a constant C /A ratio. These melting data were “sectorized” into 
5 sequential melting periods of 10 seconds each, except for the last sector, which 
was 4 seconds long. Sectorization allowed calculation of the Cj A ratio, which was 
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changing in this case. The same process of selecting a Stefan number was used, 
except each sector of melting was independently calculated using the initial and 
final lengths for that sector. Figure 4 shows the observed data and the theoretical 
predictions using the sectorized approach described here. 



6. Conclusions 

1 . Dendritic mushy zones were formed in supercooled high-purity P VA melts 
in microgravity. At the end of each experimental cycle, the mushy zone was 
progressively melted by raising the temperature of the thermostat. Video 
data at 30 fps of the melting process for one cycle (0.4K supercooling) 
are reported here. Upon raising the temperature, the mushy zone melts 
into numerous fragments, many of which may be characterized as prolate 
spheroids. 

2. Individual dendritic fragments remain motionless in the low-gravity envi- 
ronment, progressively melting toward extinction at nearly constant Cj A 
ratios. 

3. Moving boundary solutions, such as those proposed by Ham for ellipsoidal 
crystals, are not applicable to the melting process. The theory describes 
the kinetics of conduction melting of prolate spheroids of arbitrary C/A 
ratios. 

4. The quasi-static analytic results are limited to small Stefan numbers (St <C 
!)• 

5. The observed melting kinetics in microgravity of two prolate spheroidal 
crystals with differing aspect ratios were found to be well described by the 
analysis. 

6. The Stefan number required to compare theory and experiment was esti- 
mated using the time of melting and the observed lengths at the start and 
end of the melting. 
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A Reduced Model for Simulating Grain Growth 

Reiner Henseler, Barbara Niethammer and Felix Otto 

Abstract. We introduce a reduced model to study normal grain growth in 
two dimensions, which is based on the gradient flow structure of the mean 
curvature flow. In the simplified model, we restrict this gradient flow to grain 
boundaries which are straight lines. We present simulations for a large number 
of grains and we also study the effect of anisotropies. 



1. Introduction 

Many materials have a polycrystalline structure which consists of a large number 
of single crystals, or grains, with different crystallographic orientations. In the last 
stage of the evolution, the grain boundaries move under capillary forces to reduce 
their surface energy and, as a result, the grain structure coarsens. 

It is of fundamental importance for technological purposes to predict the 
time dependence of the characteristic scale of inhomogeneity, given for example by 
the average grain radius. What is often observed in experiments is the so called 
normal grain growth, which means that materials with different histories develop 
a certain universal large-time behavior. In particular, it implies power growth laws 
for the average grain size and self-similarity of the distribution of grain sizes and 
topological class (the number of neighbors). 

In modelling grain growth, attention has mainly focused on two-dimensional 
grain growth, as it appears for example in thin films and foils, due to the sim- 
pler geometry and easier access to experimental data. The mean curvature flow 
coupled with equilibrium of forces at the triple junctions is widely accepted as 
an appropriate model for the evolution of the grain boundaries. To explore nor- 
mal grain growth by simulations, one would need to simulate this complex motion 
for thousands of grains over a large time window. Due to this complexity, there 
is a large interest in deriving reduced models which allow for efficient numerical 
simulations. 

The so called mean-field models are mostly based on the von Neumann- 
Mullins law, which gives an exact relation between the growth rate of the area of 
a grain and its topological class. Based on this relation, mean-field models for the 
distribution of grain sizes and topological class have been suggested and analyzed 
(see e.g. [Fra88, FU94, FUK88] and the references therein). However, a significant 
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disadvantage of this approach is that the von Neumann-Mullins relation only holds 
in two dimensions and if all possible anisotropies are neglected. 

In this paper we investigate a different approach, which is based on the fact 
that the mean curvature flow coupled with equilibrium of forces at the triple 
junctions has an interpretation as a gradient flow. It is the gradient flow of the 
surface energy with respect to the L 2 norm for the normal velocities of the grain 
boundaries. Our strategy is to restrict this gradient flow to grain boundaries which 
are just straight lines. Thus, we can parametrize the grain network by the vertices 
and their neighbor relations, which significantly reduces the computational effort. 
On the other hand, it is conceptually simple to introduce anisotropic boundary 
energies and mobilities. A similar approach has been explored by a Japanese group 
and is known in the applied literature as “vertex models” [KNN89, NFK94]. 

In Section 2 we explain in detail our reduced model, while in Section 3 we 
present the first results of simulations for an isotropic surface energy for Voronoi 
data with 20000 vertices. We also consider the case of anisotropic surface energies, 
where the energy depends on the relative orientation of the neighboring grains. 

For more details on the physical background and references to the applied 
literature we refer to the excellent review article [FU94]. 



2. The model 

2.1. The gradient flow structure of the basic model 

In the sequel, we consider models for grain growth in two space dimensions. 
Thus, our objects are networks of curves which meet in triple junctions. Since we 
are not interested in the influence of boundary conditions, we consider one-periodic 
networks. In the sequel, we denote by 7 k,k = 1 ,...,M, the boundaries, or edges, 
of the grains, i.e. the smooth curves with endpoints in triple junctions, which 
will be denoted by = 1, ...,L. The system of grain boundaries is denoted by 

T\ I l-^f 

F U/c=i 7fc- 

We first describe how the mean curvature flow coupled with equilibrium of 
forces at the triple junctions has a natural interpretation as a gradient flow (see 
also [TC94] for the case without triple junctions). In the abstract framework we 
have a differentiable manifold A4, and a vector field /, which attaches a tangent 
vector f(x) E T X M to every point x E M. The vector field / defines a dynamical 
system x = f(x). A gradient flow is a dynamical system where / is the negative 
gradient —grad E of a function E on M. The notion of a gradient requires a 
Riemannian structure, that is, a metric tensor g on M. Then, the extended version 
of x = — gradF^ is 

9x(t) y) + (di 8E x ( t ) , y) = 0 for all y e T xW M and t 

and we find for y = x(t) that the value of E decreases along trajectories. 

In the present setting, the manifold M is just given by all the one-periodic 
networks T consisting of smooth curves which meet in triple junctions. The tangent 
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space TyM of a point T E M can be conveniently characterized by all admissible 
normal velocities v of the curves. Notice that we require the network to remain 
connected. Thus the normal velocities of the edges meeting in a vertex are not 
independent. The velocity Xi of a vertex can then be expressed by the normal 
velocities of the incoming edges. 

For the metric tensor we take in the following 

9r(v,v) := / vvds = 2_] / vv ds ^v,v G TyM. 

J rn(o,i ) 2 A*n(o,i ) 2 

Here, we assume that the mobilities of the grain boundaries are all equal and nor- 
malized to one. Otherwise, the inverse mobility would show up as a weight function 
in the scalar product. With this choice of g, we have also implicitly assumed that 
the mobility of the vertices is much larger than the mobility of the boundaries, 
such that the motion is only limited by boundary kinetics. The vertices are then 
just geometrical objects and their motion is fully defined by the motions of the 
adjoining boundaries. 

The energy of the system is just the surface energy. In this paper we restrict 
our attention to the case in which the surface tension is constant on each edge 
and may depend on the relative orientation of the neighboring grains. Thus, the 
energy is given by 

r M 

E(T) := / <r(a)ds = 'y\a{a k ) | 7 fc| 

Jr k = i 

where a denotes the surface tension, which may depend on the “misorientation” 
angle a. 

The first variation of the energy in the direction of an element v of the tangent 
space can be easily computed (see e.g. [KL01]): 

M r 

(diffi?, v) = YV(a fc ) / kv + X k ■ (T{a k )b k \X\. 

A*n(o,i) 2 

Here, i — 1,2 denote the endpoints of jk and bk its tangent. 

Hence the gradient flow is given by 

v = —a(oLk)K on each 7 *. ( 1 ) 

where k is the mean curvature of the edge, and at the triple point there holds 
mechanical equilibrium 

= o» 

ji 

where the sum extends over the three edges meeting in Xi and the bj i point in 
the direction of X{. If cr is independent of a, this just means that the curves meet 
in a 120 degree angle, otherwise it implies Young’s law that the ratios of surface 
energies and the sin of the opposite angle are equal. 




180 



R. Henseler, B. Niethammer and F. Otto 



This evolution is well-defined [BR93, KL01] until two or more vertices collide, 
after which topological rearrangements may take place. This happens when either 
an edge or a whole grain vanishes. In the first case an unstable fourfold vertex 
is produced, which immediately splits up again, usually in such a way that two 
new vertices are connected by a new edge. In this case, two neighboring grains 
decrease their topological class, whereas the two other grains increase it. When a 
three-sided grain vanishes, all neighboring grains loose an edge and thus decrease 
their topological class. In principle, grains with more than three edges can vanish. 
For the numerical simulations, we assume that this unlikely situation does not 
occur and a neighbor switching takes place before a grain with more than three 
edges becomes small. 

We recall that (1) implies for dimensional reasons that, if the typical length 
scale shows a scaling behavior, it must grow proportional to t 1 / 2 and consequently 
the energy density decreases as t~ 1 / 2 . 



2.2. Restricted gradient flow 

Our aim is now to simplify the dynamical system described in the previous section 
in such a way that simulations of large systems become feasible while the essential 
properties of the dynamics are still kept. In grain growth, one usually observes that 
the grain boundaries do not have large curvatures. This motivates a simplification 
which restricts the dynamics to networks whose curves are just straight lines. Now 
the advantage of the gradient flow perspective becomes apparent. A gradient flow 
can be naturally restricted to a lower dimensional submanifold M C M. Both 
the energy and the metric tensor restrict to Af in a canonical way. Consequently, 
we restrict the gradient flow to the submanifold of networks where vertices are 
connected by straight lines. Then, this submanifold is completely characterized by 
the vertices, together with the information on which of these vertices are connected: 

M = {X = {Xi} i & neighbor relations} . 

The tangent space TxN in a point X £ Af can then be described by the velocities 
of the vertices 



T x M={V = {V i ) i }. 

In the sequel, we will denote by 7^ the edge, which connects the vertices Xi and 
Xj. It is parametrized by 

7 ij = s Xi + (1 — 5) Xj, s G [0, 1] , 

and its normal velocity is given by 

Vij = ( sVi + (1 - s) Vj) • Uij , 




A Reduced Model for Simulating Grain Growth 



181 



where is the normal to 7y . With the definition Y, :h x ] conn.toX t 

the metric tensor can be easily computed as 



9 x{v,v) = \Y,Y^\nn\f 

1 Ji 



\vij\ 2 ds 



= ^ EE ^ l7«l [(V<-ny) 2 + (Vi-riij) (VyrUj) + (Vj-n^f 

i ji 



1 



def 



VAV 



where the elements of the matrix A are given by 

{ 3 I Tim | ^ ~ j 

rrii 

^ l7ij| nij®fiij : X* connected to Xj 

0 : otherwise 

Now we turn to the energy, which is given by 

£? w = ^EE <r «i^i» 

i ji 

where <7^ denotes the surface tension of the edge 7^ and the gradient is just 

(y E {X)) l = Y.^—y Tp- 

ji lJ 

Thus, in each time t, we can determine the direction of steepest descent X(t). The 
restricted gradient flow is thus a solution of the linear system of equations 

X (0) - Xq 

(2) 

A (X (t)) X (t) = -VE (X (t)) Vt 

Notice that in this reduced setting there is no angle condition at the vertices 
anymore. 

We now describe how we deal with topological rearrangements in the corre- 
sponding time discretized evolution. If an edge 7^ becomes smaller than a param- 
eter e during the evolution we proceed as follows. If 7^ belongs to a triangle we 
remove the triangle and replace it by a vertex with position in the center of mass 
of the triangle (cf. Fig. 1 ). 

Otherwise we replace the small edge by a fourfold vertex Xo and introduce a 
new infinitesimally small edge (X^X^) = (Xo,Xq) with new neighbor relations 
as indicated in Fig. 2 . 

We determine the velocities Xq,Xq, for which we assume X^ = — X^, as 
the direction of steepest descent in the local configuration with Xi, . . . , X4 fixed. 
Then the energy is given by 

E(X+,Xo) = ^cl*! - X 0 + l + cr 20 |X 2 - Xo | + CT30IX3 - Xq | 

+ 04OIX4 — X+| + ctoo|Xq" — X 0 |. 
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Figure 2. Change of neighbor relations 



For general velocities in Xq,X$, denoted by V+,V~ with V + = -V~ , we find 
V£'(Xq", X 0 )(V r+ , V ) = —(Tioti - V+ — (i2ot2 ’ V — 030*3 • ^ 

- 040*4 • V" + + 0oo|^ + - V~\ 

= (0-10*1 - 020*2 - 030*3 + 040*4) • (-^ + ) + 20oo|V + | 



where t^i = 1, ... ,4 denote the tangents of the edges (Xo,Xi) pointing towards 
X{. The metric tensor is given by 

9x o (V, V) = 1 ((V+ • n0 2 + (K + • n 2 ) 2 + (V + • n 3 ) 2 + (V + • n 4 ) 2 ) . 

Then, X,t = -X" is determined as a solution V of 



\(n\ (8) ni + ni (g) ni + ni (g) ni + ni (8) ni) • V - 
o 



+ (010*1 “ 020*2 “ 030*3 + 040*4) • v + 2<Joo 



V 

W\ 



( 3 ) 



It is easy to see that for (3) to be solvable it is necessary that 

|01O*1 “ 020*2 — 030*3 + 040*4 | > 2|0oo|- 
If this condition is not satisfied, we simply set 

-^0~ = “(010*1 “ 020*2 “ 030*3 + 040*4), 
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i.e. Xq is the negative force. We also compute the analogous solution for the 
configuration without neighbor switching. We then obtain the configuration which 
minimizes the energy fastest. In most cases neighbor switching is favorable, as it 
can be expected, since otherwise the corresponding edge would not have become 
small in the first place. Finally, we update Xq = Xq + &Xq" /|-X g"| etc. for small 
5 such that the new edge is larger than e. 

We have already pointed out that one advantage of this model is that it allows 
— contrary to the mean field models presented in [FU94] — to include anisotropic 
surface energies and it can in principle be extended to three dimensional grain 
growth. The applications are by no means restricted to the cases considered here. 
It can also be easily extended to allow for surface energies which depend on the 
normal of the grain boundaries or for anisotropic boundary mobilities. These cases 
will be considered in forthcoming work. 



3. Simulations 

In the numerical algorithm we start with a periodic Voronoi polygon with Nq 
vertices. We check whether we can find an edge of length smaller than e — 10 -4 r, 
where T denotes the mean edge length. If we find a small edge, we proceed as 
described in Section 2.2, solving (3) by a Newton method. Then we solve (2) by a 
conjugate gradient algorithm and update the system according to the solution X 
of (2). The time step size At is chosen in each time step as At = O-lTmin/e | . 

3.1. Isotropic case 

We first present results for No — 20000 for the case of an isotropic surface energy, 
i.e. for (Tij = 1. Fig. 3 shows the network at several times during the evolution. 

We are mainly interested in the question whether the system evolves in the 
expected self-similar manner. To that purpose, we investigate the evolution of the 
moments of the edge length distribution which should grow proportional to t 1 / 2 . 
In Fig. 4(a) we plot 

1 M i/i 

/*»(l7l) : = (jj ’ for * = |, |,1,2,3 

k = 1 

in a log- log plot. We see clearly that after a certain transition time, all the moments 
show the expected scaling behavior before finite size effects dominate in the end. 
We plot in Fig. 4(b) the energy which decreases proportional to t -1 / 2 as expected 

In Fig. 5 we plot the edge length and topological distribution respectively, 
where we averaged over the self-similar time window hit = 10 -3 to lnt = 5 • 10~ 2 . 

3.2. Anisotropic versus isotropic case 

We now want to investigate the influence of anisotropies. Therefore, we initially 
attach to each grain a random orientation 4> 6 [0, 7r), i.e. we assume that the lattice 
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Figure 3. Evolution of network in the isotropic case 





(a) Moments of edge lengths 



(b) Energy 



Figure 4. Isotropic case, No = 20000 



structure is symmetric with respect to rotations of angle 7 r. (We obtain similar 
results for different symmetries.) Then each edge is assigned a misorientation angle 

~ 1 4*ki fykjh 

where /c*, kj denote the grains which are separated by 7 ^ . With this notation we 



choose 



= 0 + (1 - 6)—zOlij{ 7T - (Xij), 
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Figure 5. Isotropic case, No = 20000 

where 6 E [0, 1] measures the strength of the anisotropy. The factor ^ is normal- 
ization such that f* cr(a) da — 7 r. 

In the sequel, we compare the results for initial data with No = 2000 in the 
isotropic case 6 — 1 and in the anisotropic case with 6 = 2/5. 

Fig. 6 shows the evolution of the moments of the edge length distribution. 
We see that also in the anisotropic case the system reaches a self-similar regime, 
the transition period being however slightly longer. 



isotropic case 



anisotropic case 





Figure 6. Moments of edge lengths, No — 2000 

That the evolution is slower is also seen in Fig. 7, which shows how the 
number of edges evolves, in the second picture in a log- log plot. 

In Figures 8 and 9 we plot the edge length and topological distributions 
respectively. Even though the amount of data currently at hand is probably too 
small to draw any firm conclusion, we observe that the edge length distribution 
is narrower in the anisotropic case and has the peak at smaller edge lengths. The 
topological distribution seems to have a larger tail in the anisotropic case and is 
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Figure 7. Edge numbers, No = 2000 



more symmetric around the peak. All these tendencies have also been observed in 
[NFK94] for a slightly different type of anisotropy, as well as in simulations of the 
full mean curvature flow [Kin]. 



isotropic case anisotropic case 




Figure 8. Edge length distribution, No = 2000 
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On a Reaction-Diffusion System for a 
Population of Hunters and Farmers 

Danielle Hilhorst, Masayasu Mimura, and Remi Weidenfeld 



1. Introduction 



Our starting point is an article by Aoki, Shida and Shigesada [1], who present 
travelling wave solutions of a reaction-diffusion system modelling the spread of 
farmers in a region occupied by hunter-gatherers. More precisely, they consider 
three different populations, namely the original farmer population, the converted 
farmers and the hunter-gatherers. We denote their densities at position x and at 
time t by F(x, £), C(x, t ) and H(x , t) respectively. The dynamics of the populations 
is governed by the following laws: 

1. All the individuals move randomly with constant diffusion coefficient D > 
0, which is assumed to be the same for all the three groups; 

2. Hunter-gatherers who meet farmers become themselves farmers at the rate 
e > 0; 

3. The growth of the three populations is logistic, that is to say that it is 
expressed by terms of the form F( 1 — A F). 

Aoki, Shida and Shigesada [1] derive the following reaction-diffusion system 
for the population densities 



Ft = DAF -f- vpF y 


1-(F + C)/K\ 




(1.1) 


C t = DAC + r c C 


1 - (F + C)/K 


+ e(F + C)H 


(1.2) 


H t = DAH + r H H 


1 - H/l\ - e(F + C)H, 


(1.3) 



where rp^rc^H > 0 are respectively the intrinsic growth rates of the original 
farmers, the converted farmers and the hunter-gatherers, and where K > 0 and L > 
0 are the carrying capacities of the farmers and the hunter-gatherers respectively. 
They discuss the spreading of farmers into a region occupied by hunter-gatherers 
by using travelling wave solutions, which they compute numerically. 

A special case of the system (1.1)— (1.3) is the case in which C = H = 0. 
Then F satisfies the classical well known Fisher equation : 



F t = DAF + rpF 



i-f/k], 



(1.4) 
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which has been thoroughly studied by many authors (see [2], for instance). 

A second special case is given by H = 0 and C ^ 0. The equations satisfied 
by F and C have the form 

F t = DAF + r F F 1 - F/K^ - kFC (1.5) 

C t = DAC + r c C [l - C/tf] - akFC, (1.6) 

where a > 0 is a constant (see [3], for instance). The system (1.5)— (1.6), with in 
general different diffusion coefficients, is called a competition-diffusion system and 
the qualitative behavior of solutions has been studied by several authors (see for 
instance [4] and [5]). 

A third special case is the case that H ^ 0 and rp = rc- Then, if we denote 
the density of the total population of farmers F + C by T, we deduce from (1.1)- 
(1.3) the system 

T t = DAT + r T T 1 - T/tf] + eTH (1.7) 

H t = DAH + r u H [l - H/l] - eTH, (1.8) 

where rp = rp — rc- The system (1.7)-(1.8) seems to be similar to (1.5)— (1.6) 
except for the sign of the coupling term. This difference is very important from 
the ecological viewpoint. The system (1.7)— (1.8) is called a predator-prey system 
with logistic growth, i.e. each population can grow in the absence of the other. 

Among the simplified systems, we consider (1.7) -(1.8). The problem which 
we study is 

u t = d u Au + 7 u u(l — u + a u v) in Q x (0, oo), 

v t = d v Av -f 7v^(l — v — a v u) in Q x (0, oo), 

d d 

—u = —v = 0 on dQ x (0, oo), 

ov ov 

u(x, 0) = uo(x), u(x,0) = vo(x) for all x G fl; 
with the hypotheses 
Hd • d u ,d v 0 , 

Ha • 7u ? 7f i Q"ii 5 Q"v ^ 9? 

Hq : is a smooth bounded domain of R N , 

H 0 : The functions uo, Vo ^ C(£l) are non identically equal to zero and nonneg- 
ative. We suppose that 0 < uo,vo < M. 

We prove that as t — ► oo, the solution (u,v) of Problem (P) converges to some 
constant pair (u 00 ,^ 00 ), which depends on the parameters. More precisely, in the 
case that 1 — a v < 0, we have that (u 00 ,^ 00 ) = (1,0), while if 1 — a v >0, then 
(u^yV 00 ) = (u*,v*) where u* and v* are determined by the equalities 

l-u*+ a u v * = 1 - u* - a v u * = 0. 

This indicates that the large time behavior of solutions of (P) is quite simple, 
because it is described by the corresponding ODE system in absence of diffusion. 
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An important problem concerns the transient behavior of solutions. Unfortunately, 
in most cases, we have to resort to numerical methods. In order to perform an 
analytical study, we consider the case in which the conversion coefficients a u and 
a v between u and v are very large. In this case, the condition 1 — a v < 0 holds. 
We prove that the solution pair first becomes close to a pair (u°, 0) where u° is 
the solution of a parabolic problem, before converging to (1,0) as t — > oo. 

In none of the cases, does a free boundary problem occur. However, if we 
relaxed some hypotheses on the coefficients or considered a slightly different scal- 
ing, we would obtain cases where the limit problem is indeed represented as a free 
boundary problem. This will be stressed in the concluding remarks. 



2. The main results 



We study the existence, uniqueness and large time behavior of solutions of Problem 
(P). We prove the following result : 

Theorem 2.1. Problem (P) has a unique classical solution pair (u,v), which sat- 
isfies 

0 < u(x,t) < Mi, 0 < v(x,t) < M 2 for all (x,t) G(lx (0, oo), (2.1) 

where M\ := max(l + a u M 2 ,M) and M 2 := max(l,M). Moreover the solution 
pair ( u,v)(t ) converges to 



(1,0) if 1 — a v < 0, 

(u*,v*) if 1 — a v > 0, 



m 



C(Q) 



as t — > oo, where (u*,v*) is uniquely determined by the equalities 
l-u*+ a u v* = l-v*- a v u * = 0. 



The next result concerns the singular limit when a u and a v tend to infinity. 
More precisely, we investigate the limit of the solutions {(u e ,v e )} of the sequence 
of problems (P e ) 



u t = d u Au + 7 u u(l — u) H — uv 



in Q x (0, oo), 



(n 



v t = d v Av + j v v(l — v) — A -uv 
d d e 

—u = — v = 0 
ov ov 

u(x,0) = Uq(x), v(x, 0) = Vq(x) 



infix (0, oo), 

on dQ x (0, oo), 
for all xGfi, 



with A > 0. Fig. 1 shows the time evolution of (u e ,v e ) when e is very small. It 
appears that v e vanishes due to the strong competition between u e and v e . Then 
u e expands and occupies the whole domain. The dynamics of (u e ,v e ) is described 
by the following theorem : 
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Theorem 2.2. Let u° be the solution of the problem 



(P°) { 



in dQ x (0,T), 



Then for all T > 0 



f u® — d u Au° + 7 u 'u°(l — u°) in Q x (0, T), 

u°(x, 0) = uo(x) T f or a n x g Q. 

A 



i 2 



(u e , v e ) — > (ir,0) in L 2 (D x (0, T)) as e — > 0. 



( 2 . 2 ) 



3. Main idea of the proofs 

In the proof of the stabilization for large time (Theorem 2.1), a major role is played 
by the nonnegative solutions of the system 



J 7uf(l - r T a u s) = 0, 

[ 7^(1 - s - a v r) = 0, 

which are given by (0,0), (1,0), (0, 1) and (u*,v*), where 

% IT a u * 1 a v 

u = and v = . 

1 T a^Oy 1 H - a u ay 

Note that if v* > 0, that is 1 > a v , those four zeros are of interest to us since they 
lie in the set {r > 0, s > 0} whereas the only zeros to be considered are (0,0), 
(1,0) and (0, 1) in the case in which 1 < a v . Note that in the first case is 

the only stable zero of the corresponding ODE system whereas in the second case 
the only stable zero is given by (1,0). 

In the sequel we present the main ideas of the proofs of the results which we 
obtain. We refer the reader to [6] for complete proofs. We prove Theorem 2.1 by 
showing the lemmas 3.1 and 3.2 below. 



Lemma 3.1. Suppose that 1 — a v < 0. If (u,v) is the solution pair of Problem (P) 
then 



(u,v)(t) — ► (1,0) in C(f2) 



1 2 



as t 



00 . 



Sketch of the proof. The idea is to construct suitable upper and lower solutions 
for u and for v. Let u be the solution of the problem 



(F) 



[u t = d u Au + j u u(l - u) 

d 

^ a = 0 

l_u(x,0) = u 0 (x) 



It follows from the comparison principle that 



in D x (0, 00 ), 
on dfl x (0, 00 ), 
for all xgQ. 



u>u in Q x (0, 00 ). 



(3.1) 
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Since the function u(t) tends to the stable steady state 1 as t — > oo, we deduce 
that for all fi > 0 there exists T = T(/jl) > 0 such that 

u(x, t) >1 — fi for all (x, t) G ft x (T, oo). (3.2) 



The inequality 1 — a v <0 together with the comparison principle imply that v < v 
in ft x (T, oo), where v is the unique solution of the problem 



(F') 



f Vt dyAl) T ^yVi^Ojy fi 

d 

—V = 0 
ov 

[v{x,T) = v(x,T) 



v) in fi x (T, oo), 
on dft x (T, oo), 
for all xell. 



Since v(t) a v fi as t oo with fi arbitrary, we deduce that v(t) — > 0 as t — > oo. 
In turn the comparison principle ensures that u < u in fi x (T, oo) for sufficiently 
large T, where u satisfies 



(F") 



Ut = d u Au + 7^(1 + fi — u) in ft x (T, oo), 
d 

< —u = 0 on 9fi x (T, oo), 

ov 

T ) = u(x, T) for all x E ft. 



Therefore u(t) < 1 + fi in ft for t large enough, which, together with (3.2), implies 
that u(t) — > 1 as t — > oo. □ 



Lemma 3.2. Suppose that 1 — a v > 0. If (u, v ) is the solution pair of Problem 
then 



( u,v)(t ) — > (u*,v*) in C(fi) 



as t 



oo. 



(P), 



(3.3) 



Sketch of the proof The starting point is the equality for all 0 < r < T 

F(T)=F(t) + J /(^a^|V U | 2 + ^ 7u a^|V,| 2 ) 

Flulv J J (a v (u* - u) 2 + a u (v* - v) 2 ), 

where T is given by 

F(t) ■= J (^y v a v (u* logw - u) +i u a u (v* logu - vfj(t), 

so that — T is a Lyapunov functional for Problem (P). We set r = 1 and let T — ► oo 
to deduce, also using the boundedness of u and v, that 

L la b ulvClv ^ U * ~ “ v ) 2 ) ^ c i + 1^(1)!, 

for a certain positive constant C \ , which in turn implies that 



(■ U,v)(t ) -> ( u*,v * 



L 2 (Q) 



1 2 



as t — > oo. 



□ 




194 



D. Hilhorst, M. Mimura, and R. Weidenfeld 



Finally, we consider the case in which 1/e multiplies the interaction terms and 
prove Theorem 2.2. 



Sketch of the proof of Theorem 2.2. We first remark that the upper bound for v e , 
namely M2, does not depend on e, but the upper bound for u e does. Multiplying 
the equation for v e by v e and integrating over x ( T > T) give 



=\ [ ( v e nt)+ r [ 7 > e ) 2 (i-n 

Z JQ Jr Jn 



(3.4) 



which in turn implies that 

l/ T ^u e (^) 2 <i|fi|(M 2 ) 2 + (T-r)|fi| 7 ,(M 2 ) 2 . (3.5) 

We deduce from the fact that u e > u, where u is the solution of Problem (F), 
that for all positive times r > 0 there exists a constant p = p(r) > 0 such that 

u e (x,t) > pi for all (x,t) G x (r, oo) and e > 0. 

Therefore, the inequality (3.5) implies that 

[ 

for some positive constant C2. Letting e — > 0 and r — > 0, we deduce that 
v e — > 0 in L 2 (fl x (0, T) as e — > 0. 

It follows from a result due to Simon [7] that the sequence {u e } is precompact in 
L 2 (n x (0, T)). Therefore, there exists a subsequence of {(u e , F)}, which we denote 
again by {(u e ,v € )}, and a function u G L 2 (fl x (0,T)) such that (u e ,v e ) — ► (iqO) 



m 



l 2 



L 2 (Q x (0,T))| as e — > 0. Let cp G C 2,1 (Q x [0,T]) be an arbitrary function 
dip 



such that — = 0 on dfl x [0, T] and <p(T) = 0. Then 



f i [< 



u € + ^v e )<Pt + (d u u € + ^T-v e 



d v 

T 



)A<^j + L (u 0 + ^vo)<^(0) 



TK- 

Letting e — > 0 in equation (3.6) yields 



7 tl u e (l - u e ) + y v £ (l “ v € ))¥ = 0. 



(3.6) 



[ [ wp t + duiiAipl + [ (ti 0 + -LoM 0 ) + [ [ 7 uu(l - u)ip = 0, 

Jo Jo. 1 J Jn A Jo Jn 

which implies that u is the unique solution of Problem (P°) and concludes the 
proof of Theorem 2.2. □ 
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4. Concluding remarks 

As we stated in the introduction, for our scaling limits the limiting problem as 
e — * 0 of Problem (P e ) is not a free boundary problem. However, when d u and d v 
are small and 7 n and j v are large, the large time behavior is the same as in the 
previous case, that is (u,v)(t) — > (1,0) as t — > oo, but the transient behavior is 
totally different. There appears a free boundary between u and v. Fig. 2 shows 
one example. Coming back to the original three component system (1.1)— (1.3) with 
r F zfz rc , a formal computation of the limiting problem as e — ► oo in (1.2) and 
(1.3) gives a free boundary problem. However, these cases are still out of reach 
from the mathematical viewpoint and will be the purpose of future research. 
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The Stochastic Geometry of the 
Crystallization Process of Polymers 

Alessandra Micheletti and Vincenzo Capasso 



Abstract. A Johnson-Mehl tessellation arises as a random division of a given 
bounded region in a d-dimensional Euclidean space, generated by a stochastic 
birth- and- growth process, also known as a germ- grain process in stochastic 
geometry. A typical example is the crystallization of a polymer from an amor- 
phous liquid phase by cooling; in this case a grain (crystal) is formed by growth 
of a germ (nucleus) born at a random time and space location. 

The kinetic parameters , i.e. the nucleation rate and the growth rate, 
may both depend upon space and time. By assuming that at contact points of 
the growth fronts grains stop growing (impingement) , the spatial region will be 
divided into cells, and interfaces (n- facets) at different Hausdorff dimensions 
(cells, faces, edges, vertices) appear. Here we extend previous results regarding 
the evolution of the morphology of the resulting tessellation to the general case 
of space and time heterogeneous birth-and-growth processes. In particular 
evolution equations for the n-facet densities are provided and a technique to 
estimate these densities from digitized images is proposed. 



1. Introduction 

Johnson-Mehl tessellations are generated by birth-and-growth processes, which 
may model many real world phenomena, like crystallization processes, spread of 
fires in the woods, spread of a pollutant in the environment, etc. [8, 11, 14]. 

Such a process may be represented as a spatially marked point process (j> = 
{(Ti, Xi)}^ G N 5 where T* represents the random time of birth of the i-th germ and 
Xi its random spatial location. Once born, each germ generates a grain growing 
at the surface (growth front) with a speed G(x, t) > 0 which may, in general, be 
assumed space- time dependent. 

When two growing cells meet, they stop growing at the contact points, thus 
forming interfaces (n-facets) at various Hausdorff dimensions; this phenomenon is 
called impingement (see figures 1, 2). 

In many applications the characterization of the final morphology of the 
tessellation is of interest, in particular the geometric measures of n-facets. We 
denote by Q l the closed set in H d representing the region occupied by all grains 
by time £, and by 0 * (Xq, To) a grain born at point Xq and time Tq and freely grown 
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Figure 1 . A photograph from a real experiment showing a 
Johnson-Mehl tessellation of polymeric crystals and a simulation 
of a spatially non homogeneous crystallization process 



Tl ME«3 sac Tl ME-S SfiC TIME- 1 0 SBC 




Figure 2 . A simulation of a 2D birth- and-growth process leading 
to a Johnson-Mehl tessellation 



up to time t. In a birth-and-growth process with nucleation events {(Xj,Tj) 0 < 
Ti < T 2 < . . .}, the occupied region at time t is then given by 

e‘= U (i) 

Tj <t 

The set 0* in (1) is also known as germ- grain model or generalised Boolean model 
(see [2] and references therein) . We will assume that the birth-and-growth process 
takes place in a bounded region E C R d , d = 1,2,3, and that G(x,t) > 0 for all 
x £ E,t G R+, so that the time of completion of the tessellation (i.e. the time at 
which all the available space is filled by the growing grains) is a.s. finite. 
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We also assume that the marked point process (MPP) (j> that models the 
birth process of the germs has a stochastic intensity v given by 

v(dx x dt) = a(x, t)( 1 — J©t- ( x))dt dx 

where a, also known as the free space nucleation rate in some applications, is a real 
valued measurable function on Ex R + such that aft, •) G C l (E ), for all t > 0 and 
such that 0 < / Q T dt f E a(x, t)dx < oo for any 0 < T < oo. The term (1 — J@t- (pc)) 
is responsible for the fact that no new nuclei can be born at time t in a zone 
already occupied by a growing cell. We will denote by 

Vo (dx x dt) = a(x,t)dtdx 

the free space intensity. 

We will assume throughout the paper that the kinetic parameters of birth 
( a(x,t )) and of growth (G(x,t)) are given deterministic fields in E x R + . If both 
fields are spatially homogeneous, i.e. they are independent from the spatial coordi- 
nate x, then the birth- and-growth process is spatially stationary . Explicit expres- 
sions for the densities of n- facets in the stationary case have been found in [12] 
(see also [13]); particular cases have also been considered in [9]. A discussion of 
the general case, in which the crystallization process is coupled with temperature, 
so that in particular a and G may depend upon an underlying field (temperature, 
in this case), can be found in [3], and more extensively in [7], Chapter 5. 

2. The Random Tessellation of Space 

At any time t during the crystallization process, the available space (and in 
particular the crystalline phase) is randomly divided into cells due to impingement, 
forming a so called (incomplete) Johnson-Mehl tessellation [8, 11] (the tessellation 
is incomplete at time t if a region with amorphous material is still present). Here we 
will consider random tessellations in R d generated by birth- and-growth processes. 

2.1. Basic Definitions 

Let us denote by Ti(y) the time at which a point y G E is reached by a crystal 
freely grown (disregarding impingement) from a nucleus a* = (Xi,Ti), where Xi is 
the random space location and Ti the random time of birth. We admit Ti(y) = oo, 
which corresponds to the case in which the point y is never covered by the i-th 
crystal. 

Definition 2.1. The crystal Ci(t) of the (incomplete) tessellation, generated by the 
nucleus a* = (Xi,T{) at the time of observation t is the non-empty set 

Ci(t) = {y £ E\ Ti{y) < t and n{y) < Tj(y) Vj ^ i}. 

We will denote by C, (J ) the amorphous region at time t . i.e. 

C e (t): = {y £ E\ n(y)>t, Vi £ N}. 
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d = 1 


d = 2 


d = 3 


n = 0 


vertex=intersection 
of 2 crystals 


vertex=intersection 
of 3 crystals 


vertex^intersection 
of 4 crystals 


n = 1 


crystal 


edge=intersection 
of 2 crystals 


edge = inter sec t ion 
of 3 crystals 


n — 2 




crystal 


face=intersection 
of 2 crystals 


n — 3 






crystal 



Table 1. Specific meaning of “n- facet” in spaces of dimension d 



Let us now introduce a rigorous concept of “interface” : 

Definition 2.2. An n-facet at time t (0 < n < d) is the non-empty intersection 
between m + 1 crystals or between m crystals and the amorphous region, i.e. 

F n (t,k 0 , . . . ,k m ) = c ko {t) nc kl {t) n ■ ■ ■ nc km (t), 

with m = d — n and ko , . . . , k m € N U {e}. 



Note that in the previous definition 

• d— dimension of the space in which the tessellation takes place; 

• n — dimension of the interface under consideration; 

• m + 1 = number of crystals (including C e ) necessary for the formation of 
such an interface, 

(see also Table 1). 

Consider now the set of all n-facets at time t 

^n(^) F' n (t, k(), • • • , &m)> 

feo,-,fcm6NU{e} 



for n < d. 

For any Borel set B one can define the n-facet mean content of B at time t 
as the measure 



Md,n(t,B) 



Md,d(t,B) 



(m + 1)! 



Xni,B f~l F n (t, k 0 , . . . , km ) ) 

feov^m6Nll{e} 



, for n < d 



E 



^ X d {Br\F d (t,k 0 )) 

_fco€N 



( 2 ) 



where A n is the n-dimensional Hausdorff measure and the symbol ^ denotes that 
the indices fco, . . . , k m must be distinct (and thus also the corresponding crystals). 
Note that, with the previous definition, Md,d(t , B) is the d-dimensional volume of 
the portion of the set B occupied by crystals at time t. 
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If *) I'd , where Vd is the d-dimensional Lebesgue measure [4], there 

exists a density /i^ ?n (x,t) such that for all Borel sets B in R d 

M d ,n(t,B)= / n d , n {x,t)dx. (3) 

J B 

Definition 2.3. The function /Jid,n(x,t) defined by (3) is called local mean n-facet 
density of the (incomplete) tessellation at time t . 

In particular pd,d(x,t) and p,d,d- i( x U) are the mean local volume density 
and the mean local surface density, respectively, of the crystallized region at time 
t (elsewhere denoted by Vy(x,t) and Sy(x,t), respectively; see e.g. [14] for the 
nomenclature). The mean local volume density is also the crystallinity. A complete 
characterization of a crystallization pattern at time t is given in terms of the family 
of densities 0 for n — 0, 1, . . . , d. 

Evolution equation for the whole set of spatial densities /x^ n (x, t) of n- facets 
have been recently obtained in [6] in term of the hazard function h (see [3] for 
the definition of h). 



Theorem 2.4. Under sufficient regularity conditions on the kinetic parameters 
a(x,t) and G(x,t) (given deterministic fields), we have 

^d,n(x,t) =C d.n^ 1 ^r( 1 -£(M))[G(x,f)] _m (4) 

where for d — 2,3 

hk(x, t) = [h(x, t)] k , k = 2,3, 

and Cd,n are numerical constants depending only upon the dimensions d and n 

p^ dm+n -\- 1 ^p^ d ^m+1 

Cd * n ~ ( m + i)!r(M»)r(^)"T(^ ) ' 



System (4) is closed by the evolution equation for the hazard function, which 
is given by (see [5]) 

h(x,t) = G(x,t)S ex (x,t). 

The evolution equation for S ex (x,t) is provided in [3]. 

In the spatially homogeneous case the measure Md,n(t, •) is translation- 
invariant, so that 

-M.d,n(t, B) = P'd,n{t)l / d(B ) , 

where Vd(B) is the d-dimensional volume of B. Then, by the use of the hazard 
function h(t), which in this case has an explicit expression in terms of the kinetic 
parameters a(t) and G(t), the following expression for the densities /id,n of an 
(incomplete) Johnson-Mehl tessellation can be derived [9, 12, 11]: 



l^d^nif) — C-d,n ^ $d j G{ll)d‘ 



d - 1 

lu | a(s)ds 



-i m+l 



G(r)p(r)dr (5) 



where d=l,2, 3, n = 0, ...,d,ra = d — n and Sd is the surface of the unit ball 



m 
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3. Estimating the Local Density of Interfaces 

Here the problem of estimating the measure of the boundaries of the cells of a 
2D or a 3D tessellation on digitized images is tackled. Note that, since e.g. a line 
in a 2D digitized image is represented by pixels, which are 2-dimensional objects, 
the task of measuring its length can be difficult. Here a method is proposed to 
estimate the measure of ( d — l)-dimensional objects in a d-dimensional space. 

3.1. Estimate of the Density of d— 1-Facets via the Spherical Contact Distribution 
Function 

For the sake of simplicity, let us denote by 

E(t): = E d -i(t) = (JdC'i(t) 

i 

the random set of the boundaries of the cells of a Johnson-Mehl tessellation at 
time t in H d (here we will only deal with the cases relevant in the applications, 
i.e. d — 2,3). Under sufficient regularity conditions on the growth and nucleation 
rates, this set may be assumed to have Hausdorff dimension d — 1. 

Definition 3.1. The local spherical contact distribution function H s associated with 
a random set E [4] is 

H s fr,x): = P(x G E® 6(0, r)| x £ E), 

where 0 is the Minkowski addition (A ® B = {a + 6| a G A, 6 G J3}, VA, B C H d ), 
and 6(0, r) is a d-dimensional ball of radius r centered at the origin. 

Since our random set Eft) has Hausdorff dimension d— 1, for any fixed x G H d 
we have 

P(oc <£ E(t)) = 1, 

so that the spherical contact distribution function associated with Eft), which will 
be denoted by H s (r,x,t), is 

H s {r, x, t ) : = P(x G S(t) ® 6(0, r)). (6) 

The estimate of fid 4 - 1 is then possible thanks to the following theorem, which has 
been proven in [10]. 

Theorem 3.2. Let Eft) = |J i dCi(t) be the set of the boundaries of the cells of 
an (incomplete) Johnson-Mehl tessellation at time t having Hausdorff dimension 
d — 1. Let E r (t) be its parallel set of radius r, i.e. 

E r (t) = Eft) © 6(0, r). 



If the condition 

Um Vd[Zr(t)nb(x,£)} 

r — »0 



2i/ d -i(dE(t)nb(x,e)) 



r 



(7) 
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is satisfied for all x £ K d and for all £ E R + then the function H s (r,x,t) is 
differentiable at r — 0 for v&- almost all x, and its derivative satisfies 

fH s (r,x,t) | r=0 = 2jU d , d _i(:M). (8) 

It can be proved via the Coarea Formula that Condition (7) is fulfilled by a 
set S C H d if the function 

/: R d -+R+ 

x i — >d(rr,S) 

(where d is the euclidean distance) is Lipschitz [1]. 

Formula (8) may be used to estimate (id,d - 1 from an estimator of H s . Suppose 
to recover a black-and-white image of the boundaries of the simulated crystals, as 
shown in Figure 3. Note from (6) that H s (r,x,t) equals the local volume density 
of the parallel set H r (t) of S(t), which is in general nonzero, being S r (t) a set of 
Hausdorff dimension d . 

An unbiased estimator of H s (r 1 x,t) is then 

1 n 

H s (r,x,t ) = -y^l Hr(t) (xj), 

i = 1 

where X{ are points of a grid overlapped to a suitable window of observation. An 
estimator of fj^H s {r,x,t ) | r=0 ma Y be obtained by numerical approximation, thus 
an estimator of the interface density Hd,d-i(x,t) can be recovered. 

The result of an estimation performed on simulated data is shown in Figure 3. 
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A Posteriori Error Control of 
Free Boundary Problems 

Ricardo H. Nochetto* 



Abstract. A posteriori error estimators are computable quantities depending 
on the discrete solution(s) and the data, which provide upper (and lower) 
bounds for the error; they are thus instrumental for adaptivity. This pa- 
per assesses the derivation of a posteriori error estimators in the context 
of free boundary problems (FBPs). The discussion is based on simple exam- 
ples, from contact problems to degenerate parabolic equations and curvature 
driven flows, and addresses both space and time discretization as well as both 
energy and maximum norm estimates (including interface error estimates). 
Keywords: a posteriori error analysis, contact problems, degenerate parabolic, 
interface error estimates, curvature. 

AMS subject classification: 35K55, 65M12, 65M15, 65M60. 



1. Residuals and A Posteriori Error Estimators 

We consider estimating the error committed in approximating the PDE 

u t + ${u) = f (1) 

solely relying on the data and the computed solution U. This yields the so-called 
a posteriori error estimates and is instrumental in modifying meshes and time 
steps (adaptivity). Adaptive error control is indeed a popular and exciting area of 
current research in science and engineering. The purpose of this paper is to survey 
recent results for (1) within the framework of free boundary problems (FBPs). 

Error and Residual. Let U be the approximate solution, and let 1Z be the residual 

n := -Ut - 3(U) + f, (2) 

that is the amount by which U misses to be a solution of (1). If e := u — U denotes 
the error, then it satisfies the error equation 

et+$(u)-$(U) = K (3) 

Ideally, we would like to derive estimates of the form 

CA\n y <\\e\\x<C 2 \\n\\y (4) 

* Partially supported by NSF Grants DMS-9971450 and DMS-0204670, and NSF-DAAD Grant 
INT-0129243. 
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in suitable functional spaces X and y. The upper bound shows that the quantity 
||7£|fy is reliable, whereas the lower bound indicates that it is efficient; we will 
focus on reliability in this paper. Key issues are to identify suitable pairs (X,y) 
and evaluate the norm || • || y in practice, which usually is a negative Sobolev norm. 

Failure of Duality for FBPs. A popular technique from linear PDE theory to 
establish (4) is duality [10, 11, 12]. If # is Lipschitz, we can rewrite (3) as 

e t T Ae = 7^,, 

where 

A:= f V$(su + (l-s)U)ds. 

Jo 

If we formally multiply by (p G C x (0,T) and integrate by parts over (0,T), we get 

{e,(p t - A* (p) + [ (5) 

Jo 

where A* is the adjoint of A. An error estimate follows by selecting cp in (5) as 
the solution of the backward dual problem 

<p t -A*<p = 0 in (0, T), <p(T) = p, (6) 

and using strong stability properties of (/?, such as a bound for <p t in terms of p, 
for evaluating <p(0) and (7 Z, <p). This program may fail to apply to the present low 
regularity setting because 

• # may not be Lipschitz, and so A and A * may be singular or not even defined; 

• (6) may make no sense or (p t may be unbounded; 

• (p is not computable because it depends on the unknown u\ 

• Replacing A by V$(U) to determine would lead to additional terms involving 
further derivatives of the possibly singular V#. 

Therefore, except in special circumstances such as [5, 24], duality does not quite 
lead to rigorous error control for FBPs. 

Plan of the Paper. We show below several novel techniques for error control ap- 
plicable to FBPs. In §2 we present pointwise error estimates in space via contact 
problems and apply them for error control of free boundaries. In §3 we discuss 
error control for time discretizations using the implicit Euler method for (1) with 

• being either a subdifferential or an angle-bounded operator (Hilbert theory), or 
accretive (Banach theory). We conclude in §4 with some extensions. 



(e(T),<p(T)) = (e( 0)^(0))+ / 

Jo 



2. Error Control in Space: Contact Problems 

2.1. Setting and Challenges 

Let fl be a bounded, polyhedral, not necessarily convex domain in R d with d G 
{1,2,3}. Let / G L°°(fl) be a load function, x £ H 1 (Q) fl C 0,a (fi) be a lower 
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obstacle, with 0 < a < 1, which satisfies x < 0 on dCt. Let JC be the following 
non-empty, closed and convex subset of H 1 (f2): 

JC := {v G H 1 (f2) | v > x a.e. in fl}. 

The variational formulation of the continuous obstacle problem reads as follows: 
u G JC : (Viq V(u — v)) < (/, u — v) \/v G JC. (7) 

Hereafter, ((/?, denotes the scalar product in L 2 (fl) as well as the duality pairing 
between H l (Ct) and 17 -1 (fl) — H we refer to [15, 16, 28]. We are interested 
in the approximation of both the solution u in X := L°°(fl) and the contact set 

A := {u = x} := {x G ft | u(x) = x(^)}, 
or equivalently of the free boundary or interface 

T := > x} fl fl. 

The location of A is encoded in the non-positive functional a G i7 -1 (fl) 

{<T,'p):={f,'p)-(Vu,V<p) VveH'iQ), (8) 

which plays the role of a multiplier for the unilateral constraint. In fact, we have 
a — f + Ay in the interior of the contact set A = {u = x}, where a is typically 
< 0, and a = 0 in the open non-contact set fl \ A = {u > x}- 

Given a shape-regular partition 7^ of fl, the set of nodes of 7^ is denoted by 
J\fhi and the subset of interior nodes by A 4- Let indicate the space of continuous 
piecewise affine finite element functions over 7^ and •= V/* fl H 1 ( fl). The nodal 
basis functions of are given by (4> z ) ze j^ h , and they form a partition of unity of 
fl, that is ^2 z£Af h ( t )z ~ L Let ^ h he the Lagrange interpolation operator onto V^. 
Let Xh ■= I hX he the discrete obstacle. The discrete counterpart JCh of JC is then 

IC h := {vh €%\v h >Xh in fl}. 

Note that it is sufficient to check the unilateral constraint of JCh only at the nodes. 
The set JCh is non-empty, convex, closed but in general not a subset of JC (non- 
conforming approximation). The discrete obstacle problem reads as follows: 

U h G JC h : (Vu h i S/(u h - v h )) < (/, u h - v h ) \/v h G JC h - (9) 

Let lu z = supp ((f) z ) be a generic finite element star and be the union of all 
interior interelement boundaries (sides) in u z . If Jh denotes the jump of the normal 
derivative of Uh across the interior sides, we define the set of full- contact nodes to 
be 

Ch = {z G Uh | U h = Xh and / < 0 in u z , J h < 0 on 7*} 
and denote the discrete full- contact set and its complement by 

n° h = { x € n | Mx) = 1}, nt = nw°- 

z€C h 

Furthermore we set = T fl Q® and = T fl along with 

=w z nn%, 7+ = 7 2 n 



( 10 ) 
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FIGURE 1. Localization for an obstacle with downward cusp and / = — 1. 




FIGURE 2. Localization for an obstacle not resolved on coarse grids. 



To shed some light on the inherent difficulties in estimating ||e|| L oo(^) we 

consider the following Id examples. The estimator should possess the properties: 

• Be insensitive to / within because / should not dictate the mesh quality in 
0% ( localization ). Figure 1 shows this since meshes are rather coarse in 

• Be insensitive to the obstacle within and thus reduce to the usual esti- 
mator for the (unconstrained) Laplacian. This effect is shown in Figure 1: the 
refinement is due to the curvature generated by / = — 1 but not to the cusp 
of x pointing downwards. Overall there is an excellent accuracy for u in the 
non-contact set even though the approximation of x is rather rough. 

• Be able to detect the non-conforming situation x > u h, and refine accordingly. 
This is depicted in Figure 2, where the concave part of % above Uh is detected 
early on and thereby the solution is lifted up. 
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Let II h : Li(fl) — > be the interpolation operator of [4]; see also [27]. Such 

a II h is both positivity preserving and second order accurate, and plays a crucial 
role in defining the discrete counterpart ah of a. Namely, we let ah G i7 _1 (fl) be 
(°h, (p) := EzeM, P0*)> where 



( 11 ) 

Jh (^) 

for all z G and (p G Note that 11^ is evaluated at the node 2 , and is 

thus a constant for each z G A fh • It turns out that 

^ g A fh \ Ch — ^ {&hi t 4 *z) = ( 12 ) 

where s z is a nodal multiplier: 

s z := [ ftz+ [ Jhtz VzeNh. (13) 

Jn Jr 

Moreover, s z satisfies s z < 0 whenever z G Nh U Ch and ah < 0. We finally let 

A h := : z G A U (4 fl dfi) and < 0} (14) 

The discrete multiplier ah satisfies ah < 0, cancels with cr inside the discrete 
full contact set and localizes to the non-contact set Even though < 7 ^ is 
non-computable, we only need the nodal multiplier s z in (14) and Theorem 2.1. 

2.2. Pointwise A Posteriori Error Estimates for Solutions 

Let T) z be the star-based residual indicator 

Vz := h z \ \(f ~ fz) 0z || Looiujf) h z || Jh 0z 1 1 ( 7 +)» (1^) 

where 7 + are defined in ( 10 ) and f z is any constant provided J^+/ 0 Z + 
f r + J h (j) z = 0 and 0 otherwise. The interior residual reduces to oscillation of /. 

Theorem 2.1 (Reliability). Let u be the continuous solution of (7) and let Uh be the 
discrete solution of (9), respectively. Then the following global a posteriori upper 
bound holds: 

\W - w /illLoo(ft) < £/i> (15) 

where the error estimator £h is given by 

£ h := C*|log h m[n \ 2 max ze j\f h rj z (localized residual) 

+ ||(X - m/i) + Hl 00 ( w ,) + II (uh - x) + llL 00 (A h ), (localized obstacle approx.) 

C * is a constant solely depending on the mesh regularity , and Ah is defined in 
(14). 



(<Th, ■= / fV<t>z+ / Jh<P4>z 

M H 

+ / / (lW)(z) 4> z + f 

Jnf Jr 
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The derivation of this optimal bound is done as follows. We first note that 
both a and ah are designed to restore the equality in (7) and (9). We then form 
the so-called Galerkin functional Qh £ 7f _1 (0): 

{Gh, <f) = (V(u - u h ), Vip) + (a- a h , <p) = - (Vu h , V<^) + (/ - a h , <p) . 

This concept, introduced in [32], plays the role of residual in the linear setting. A 
simple manipulation reveals the localization effect built in (11), namely, 

{Gh, v) = [ fW - u h <p] + [ Jh[p- iW]. (17) 

M Jr i 

We next let w € H 1 (12) be the the Riesz representation of Gh, namely —Aw = Qh, 
and use (17) to derive the pointwise estimate 

IM| Loo (n) < C\log h min \ 2 max rj z . 

zeNh 

Such w is employed to correct Uh and thereby construct barrier functions for (7) 
which are of the form Uh + uo ± ||w||o,oo ; fl P^ us additional terms involving Uh — x 
(last two terms of £h)- The continuous maximum principle allows us to compare 
these barriers with u and derive (16). A full proof is given in [26]. We also refer to 
[25, 26] for lower bounds, and to [14] for total variation problems. 

Example: Lipschitz Obstacle. Let := {x £ IR 2 : |x|i < 1}, / := —5, and the 
obstacle be x( x ) := dist(x,c?0) — |. The corresponding multiplier a is a measure 
singular with respect to the Lebesgue measure since it has a mass supported along 
the edges of the pyramid within the contact set. 




FIGURE 3. Discrete solution and obstacle on grids aligned with the 
edges of the obstacle (left) and non-aligned with them (right). 



Figure 3 displays Uh and Xh over two meshes, one aligned and the other 
non-aligned with the edges of x : m the latter case, the representation \h of x 
rather crude. In contrast, the aligned meshes provide excellent approximability of 
X, and the full localization property of £h is clearly reflected in the meshes on the 
left column of Figure 4: the refinement along the diagonals is avoided. This is a 
crucial improvement of the estimator of [26] with respect to the partially localized 
estimator of [25]. The refinement for the non-aligned meshes is instead due to the 
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lack of resolution of y and thus of u (see right column of Figure 4). Note that 
the non-aligned grids do not satisfy the discrete maximum principle, which is not 
required for Theorem 2.1 to hold. These results were obtained with ALBERT [31]. 




FIGURE 4. Grids and level sets d{un = Xh} for adaptive iterations j = 
1, 7, 15: aligned grids (left) and non-aligned grids (right). The benefits of 
full localization are apparent since the refinement across the diagonals 
in contact is completely avoided (left). 



2.3. A Posteriori Error Estimates for Interfaces: Barrier Sets 

Since the estimator Eh controls the pointwise error \\u — w/ 1 ||l 00 (Q )5 we can obtain 
a posteriori information on A and also on the exact free boundary (or interface) 
T. We need the additional nondegeneracy condition: there exists A > 0 such that 

</, <P) - <Vx, Vy>) < -A f <p V(p £ H'iQ),? > 0. ( 18 ) 

Jn 

Condition (18) guarantees stability of the exact free boundary T and is due to 
Caffarelli [2]; see also [15, §2.10]. This condition has been crucial in the a priori 
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error analysis of free boundaries [1, 20] as well. If 

2d / \ 

r l : = y (2£/j + ||(x/j - x) + l|L 00 ({« h < Xh +^}))> (19) 

we then define K := {dist(-, <9f2) > r^} and the barrier sets 

A* := {u h < X + £h}, A* := jdist (•,{«/, > Xh + £&}) > r*}. (20) 




IHBBHBESE 



o’ o it 









FIGURE 5. From balls to bones: Interface barriers for tolerance « 0.01 
(left) and adaptive grids for tolerance « 0.15 (right) in 2d for forcing 
term / = 0,— 5.9,— 8.1,— 15 (from top to bottom). The distance of the 
barriers is « 0.05 for all the four forces. 
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Theorem 2.2 (A posteriori control of contact set and interface). The set A* is an 

upper barrier set for the exact contact set A = {u = x}, i- e - A C A*. 

Moreover, if the stability condition (18) holds, then the set A* is a lower 
barrier set for A in the sense that A* fl K C An K, whence 

TnK c (A* nK)\ int(A* n K). 

A proof is given in [26] . Note that A dictates the thickness of the strips Ct\K 
and (A* D K) \ int(A* D K), and thus acts as a condition number for the free 
boundary. Also, if x € # 2 (^), one can adaptively compute the condition number 
A by A = - sup { Uh <xh+£ h }(f + ^x) [26]. This is used in the following example. 

Example: From Balls to Bones. Let fl = (-2,2) x (—1,1) and the obstacle % 
be x( x ) — 10 — 6(xf — l) 2 — 20(|x| 2 — x\ ), which consists of two hills connected 
by a saddle. Results, reported in Figure 5, were obtained with ALBERT [31]. 
The left column in Figure 5 depicts the interface barriers for four constant loads 
/ = 0,— 5.9,— 8.1,— 15 for about the same tolerance r « 0.01; the exterior curves 
correspond to <9A* whereas the interior curves display SA* . There is a topological 
change, with a double-cusp singularity at the origin, between the extreme values 
/ = 0,-15 corresponding to f cr [ t . The barrier sets A*, A* give a reliable range 
for /crit ; see [26]. Although T develops a singularity, it is worth noticing that 
u € W^(fl) and thus no special refinement is needed to approximate u. Moreover, 
A = 16 in fl for the 4 loads, which shows that the double-cusp is not due to lack 
of stability. The interface estimate of Theorem 2.2 thus applies and provides a 
posteriori error control of the entire free boundary including the double-cusp. 

3. Error Control in Time 

Given the abstract PDE in either a Hilbert or a Banach space 

u t +$(u) = 0, (21) 

we now derive error estimates without regularity assumptions on $ for 

— (U n -U n -i)+3(Un) = 0 . ( 22 ) 

Let U(t) be the piecewise linear interpolant of {U n }^ =0 and set e :=u — U. 

3.1. Hilbert Theory: Subdifferential Operators 

Let TL be a Hilbert space, with scalar product (•, •) and norm | • |, and let : 7-^ — > 
R U {oo} be a proper lower semicontinuous convex function, with domain D ((/>). 
Let # = d(j ) : TL — > 2 n be the subdifferential of </>, with domain D($) C D ((/>). Then 
(21) is equivalent to the variational inequality 

(u t) u - v) 4- 4>{u) - (j>{v) <0 Vv E (23) 

and it satisfies the energy identity 

M*)l 2 + i^( M (0) = ° a.e. t € (0, T). 



(24) 
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Energy Dissipation Estimator. On the other hand, (22) is equivalent to 
-(U n - U n -u Un -v) + 4>(Un) - 4>{v) <0 v»e £(</>), 

Tn 

which, upon taking v = C/ n _i, yields the discrete energy inequality 



Sn 



Un-U, 



n—1 



2 (t>(u n ) - 4 >(u n 



+ 



<0 V 1 < n < AT. 



(25) 



(26) 



The energy dissipation estimator £ n > 0 in (26) measures the deviation of {U n }^ =0 
from satisfaction of the conservation form (24). 



Theorem 3.1. IfU 0 G D{<t>), then max te [ 0jT ] |e(i)| 2 < |e(0)| 2 + Yln=i r 2 £ n . 



Proof. We first write a continuous evolution variational inequality for U 

{u t (t),u(t)-v) + 4>(u(t))-4>(v)<n(t), ( 27 ) 

where 7 Z(t) is a residual term, independent of the test function u, given by 
K(t) := (U'(t), U(t) - U n ) + <t>(U(t)) - <£([/„), G (*„_i,i„). 

We next estimate lZ(t). In view of the elementary identity 

U(t) -U n = ( U n ~ Un-l) = (t~ tn)Ut(t), 

and the convexity of </> 

<KU(t)) < ^ 4>(U n - 1) + 1 ~ tn ~ l 4>{Un ), 

T~n Tn 

(26) implies 7 Z(t) < (t n — t)£ n . Finally, choosing v = U(t) in (23), v = u(t) in 

(27) , and adding the two inequalities, we see that the terms involving </> cancel out 
and we end up with the error inequality 

^(||e(f)| 2 < (t n - t)£ n a.e. t G (t„_i,f„). 

Upon integration in time, this upper bound readily implies the assertion. □ 



Monotone Residual Estimator. The discrete counterpart of the dissipation energy 
inequality j^\u t (t)\ 2 < 0, namely, 

r- 2 |!7 n -t/„_ 1 | 2 <r-_ 2 1 |t/ n _i-17 n _ 2 | 2 Vl<n<JV, (28) 

can be derived as follows. Taking v — U n in (25) at the time-step n — 1, with 
:= U 0 - t 0 $(U 0 ), we find 

T-hiUn-! - Un-2, U n - 1 - U n ) + 4>(U n - 1) - <t>(U n ) < 0- (29) 

Dividing (29) by r n and replacing it into (26), we obtain (28) from the elementary 
inequality 2(v — w,v) > \v\ 2 — \w\ 2 for v,w G H. Recalling (22), we also deduce a 
crucial bound of £ n in terms of a monotone residual estimator V n \ 

£n < Tn l {Wn) ~ 3{U n -l ),U n ~ U n -l) == V n . 

Combining (30) with Theorem 3.1 yields the following error estimate. 



(30) 
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Theorem 3.2. max t€ [ 0iT ] |e(t)| 2 < |e(0)| 2 + Yln=i r n^V 

Rate of Convergence. We now prove that the estimates in Theorems 3.1 and 3.2 
give rise to upper bounds of the error with respect to the largest time-step r (a 
priori bounds). A simple first result is directly suggested by (26), which implies 

0 < T n S n < (f)(U n -i) - <j>(U n ). 

Therefore, we end up with a telescopic sum 

N N 

5>n£» <ry(# n _!) - </>(u n )) = T{<j>(u 0 ) - 4>(U N )). 

71=1 71=1 

Without loss of generality we may assume </> > 0, for otherwise we may add an 
affine function. We then obtain a rate of convergence of order 0(y/r) 

Theorem 3.3. IfUo € D ((/>), then max tG [ 0 ,T] l e W| 2 < |e(0)| 2 + <t>(Uo )r. 

On the other hand, making use of (25) in conjunction with (30), we can 
rewrite the monotone residual estimator V n as follows: 

Vn = (Wn-l)-$(U n ),$(U n )). 

By virtue of the property V n > 0 and the inequality 2 (v — w,v) > \v\ 2 — \w\ 2 , we 
then see that Xl^Li ^ n telescopes 

2^> n 2 P n < T 2 ^ (mt/n-!)! 2 - mUn)\ 2 ) < r 2 mUo) I 2 - 

71=1 71=1 

We then obtain the following linear rate of convergence. 

Theorem 3.4. IfUo G D($), then max tG [ 0 ,T] |e(t)| 2 < |e(0)| 2 + ^|3'(b r o)| 27 ' 2 - 

We point out that no regularity of 4> or # has been used as well as no con- 
straints between consecutive time-steps t n . We stress that the a priori rate of 
convergence depends on the regularity of uo , which dictates the size of |e(0)|. We 
also note that a linear rate of convergence cannot be expected in general for mono- 
tone operators without further structure [29] . We finally refer to [30] for extensions 
to the Wasserstein metric. 

Examples. We consider two prototype parabolic FBPs for which duality does not 
give optimal results. The first one is the parabolic obstacle problem in H = L/ 2 (£l)'- 

u > 0 : ( u t , u — v) + (Vix, V(w — v)) < 0 V v > 0. 

If we denote by Z n := AU n - ^-( U n - U n - 1 ) < 0 the discrete multiplier corre- 
sponding to the unilateral constraint U n > 0, then Theorem 3.2 holds with 

P„=r- 1 ||V(P n -t/ n _ 1 )|| 

L 2 (fi) + T- 1 Un-Un- 1) • 

The second example is a degenerate parabolic problem in H = i7 _1 (f^) which, for 
/ 3 non-decreasing, reads 



u t — A (3(u) = 0. 
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Then Theorem 3.2 holds with 

V n = T- 1 ((3(U n ) - /3(Un- 1 ), u n - U n —\) . 

3.2. Hilbert Theory: Angle-Bounded Operators 

Angled-bounded operators form a class of monotone operators larger than subdif- 
ferential for which optimal error control is possible [23] . They satisfy for 7 > 0 

(3(v) - S{w), W - z) <7 (S(v) - S(z), V - z) W, to, z e DCS). 

Theorem 3.5. The results of Theorems 3.2 and 3.4 are valid for # angle-bounded. 



Example. We consider the convection-diffusion equation in TL = L 2 (Q): 

$(v) = —v 2 Av + b • Vv + cv. 

If \b(x)\ < bo and d(x) = — ^divb(x) + c(x) > dg, then we prove in [23] that 



7 



b 2 o 



v 2 (dl + v 2 pl) 



)■ 



where po is the Poincare constant for zero Dirichlet boundary values, along with 
the optimal a posteriori error estimate 

ll W — ^Ilioc^O^L^p ^ 2 ||V(^ - U"j || (o ; T ;L 2 (S"2) - 11^0 “ ^o||i 2 (n) 

N r 

+ 2 7 2 E r n(/ AMU n - ^n-l||i 2 (f 2 ) + d(x)\\U n — t/ n - 1 ||i 2 ( n ) )- 

^ i J O 



n—l 



3.3. Banach Theory: Accretive Operators 

We consider now a Banach space B with norm || • || and dual B* . Let (•, •) be the 
duality pairing between B and £P, and let J : B — * 2 s * be the subdifferential 
of the norm, which satisfies \\v\\ = (v,J(v)) for all v e B. Let # : B — ► 2 s be a 
(multivalued) accretive operator, namely, 

(S(u)-S(v),J(u-v))> 0 Vu,veD{S). (31) 

If U(t ) = U n for f n _i < t < t n is a piecewise constant interpolant of {£/„}() L 0 , 
then (22) also reads 



U t (t)+$(U(t)) = 0 a.e. in (0,T). (32) 

If, to derive an error bound, we naively take the difference of (21) and (32), 
and multiply it by J(u(t) — U(t)) to exploit (31), we realize that (u t — U t ,J(u — U)) 
is no longer the derivative of \\u — U || as in §3.1. The difficulty of dealing with 
two strong nonlinearities, $ and J, arises and prevents us from deriving error 
bounds. We are thus forced to deal with the continuous and the discrete equations 
separately; below are the crucial steps of our argument. We refer to [21] for details. 
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Evolution Equation for the Continuous Solution. Since J is the subdifferential of 
the norm, (21) is in fact equivalent of the evolution equation 

^||ii(«)-«|| = -(3 r («(t)),J(u(t)-«)> VveD($), (33) 

which we will try to reproduce in the discrete setting, starting from (32). 

Evolution Inequalities for the Discrete Solutions. The map s i— > \\U(s) — V\\ is 
convex on each interval (£ n _i, t n \ and therefore its derivative is not decreasing and 
is bounded by its value at s = t n . Then, for all V E B and a.e. t E (0,T) 

±\\U(s) - V\\ = (U,(s), J(U(s) - V )) < (U s (s), J(U(s) - V )). (34) 

Combining (32) with (34), for all V E B and a.e. s E (0,T), we obtain 

l||c/( s) _ v\i < -($(U(s)),J(U(s) - V)). (35) 

Doubling Variables. To combine (33) with (35) via (31), we would like to choose 
v := U and V := u and add the equations. However, it is impossible to write 
(33) and (35) at the same time “t = s” , since we have derived the equations with 
respect to time-independent test “elements” v,V. Therefore we keep two distinct 
“time” variables t,s and choose v := I7(s), V := u(t), thus obtaining from (31) 

d s \\ U(s) - u(t ) || + d t \\U(s) - u(t) || < 0. (36) 



Penalization. To prevent t and s from getting too far apart, we penalize their 
distance with a parameter e E (0,T), and thus integrate (36) in the 2d strip 

S^t : = {(s,t) e M 2 : 0 < t < T, t-e<s<t}. (37) 

In order to deal with negative values of s, we extend U, U in (— e, 0) by 

U(s) := U 0 , U(s) := U 0 - s$(U 0 ) (38) 

so that (32) still holds. Applying the Gauss-Green formula in Sq T gives 

[ T ||C7(s) -«(T)||ds < £||u 0 - U 0 \\ 

Jt-e 

+ £ (\\m - u(t) II - II U(t) - «(0||) dt (39) 

+ £ (|| U(t -e)- u(t ) || - || U(t - s) - u(t)||) dt. 

Stability. We take the difference between two consecutive equations (22), and 
multiply by J(U n — U n - 1 ), to arrive at 

T~ l {U n ~ U n -i,J(U n ~ Un-!)) + (3(U n ) ~ Wn- 1), J(U n ~ U n - 1)) 

= T-\{U n - 1 - U n -2, J(Un - f/n-l))- 
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Making use of (31), together with property ( v,j(w )) < \\v\\ for all v G B with 
equality for w = v, we get 

T^WUn - t/„-i II < T-^WUn^ - U n - 2 \\ V 1 < n < N. 

If we now choose U-\ = Uo — tq$(Uo) with r 0 < £, which is consistent with (38), 
recursion yields the following discrete stability estimate: 

sup \\U t (t)\\= sup — \\U n - U n -1 II < ||3 r (t/o)||. (40) 

te(0,T) l<n<N T n 

A Posteriori Error Estimate. Since 

\\U(t) - u(i)|| - || U(t) - u(t ) II < \\U(t) - U(t) II, 

\\U(t-s) -u(t)\\ - \\U{t-£) -u(t)\\ < \\U(t-e) - U(t-e)\\, 

and f® e || U(t) — U(t)\\dt = §||3 r (C/o)||i a simple manipulation of (39) implies 

- [ T \\U(s)-u(T)\\ds<- [ T (||f/( S )-u(T)|| + ||C/( S )-C/( S )||)d S 
e Jt-s e JT-e 

< IK - U 0 \\ + |||5(f/o)|| + - £ J* ll^(t) - U(t) II dt. 

Since / Q T ||t/(f)-t/(f)|| dt = \ J2n=i T n\\U n -U n -i\\, and (40) leads to 4 J^_ e ||C/(s) 
-U(T)\\ ds < |||5'(f/())||, we readily infer that 

i N 

I HT) - U(T ) || < |K - U 0 \\ +e||3Wo)|| + - ^r n ||C/ n - 

^ n= 1 

Upon optimizing £, we conclude the desired a posteriori error bound of [21]. 
Theorem 3.6. IfUo G D($), then 

||e(T)|| < ||e(0)||+2||^o)H 1/2 (E r «ll [/ «- C/ -ill) 1/2 - 

n= 1 

The idea of doubling the variables and penalizing goes back to Kruzkov for 
first order conservation laws [17]. 

A Priori Error Estimate. To show that the above error bound exhibits the correct 
asymptotic order of convergence, we resort to (40) to deduce 

N 

'Y^r n \\U n - U n -i\\ < TT||3 r ([/ 0 )||. 

n= 1 

This yields the following celebrated a priori estimate of Crandall and Liggett 
[ 7], originally derived for constant time steps by a different technique: 

||e(T)||<||e(0)||+2||^(C/o)||^T. (41) 



Example. Consider a degenerate parabolic PDE u t — A (3(u) = 0 in B = Li(fi). 
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4. Extensions 

We finally discuss briefly two extensions. We refer to [18, 19] for details. 

4.1. Mean Curvature Flow of Graphs 

Let a surface T(t) be described by the graph u : Q x [0, T] — > E. Let 

Qu := (l + |Vu| 2 )^ 2 , Nu := (Vu, -1 )/Qu, Vu := d t u/Qu, 

be the elementary area, the unit normal vector and the scalar normal velocity of 
r (t). We assume that T(t) evolves according to the geometric law 




Note the presence of Qu in the first term, which makes the study of this equation 
much harder than (21). Let U : [0,T] — > denote the semidiscrete finite element 
approximation 

A key step in the analysis is to identify the correct geometric quantities to estimate. 
This was done by Deckelnick and Dziuk for the a priori analysis [8], and boils 
down to examining 

E(t):= [ \Nu — NU\ 2 Qu + [ [ \ Vu - VU\ 2 Qu. 

Jn Jo Jn 

Theorem 4.1 (Conditional Estimate). There exist two computable estimators (t) 
and £ 2 (t), and a constant Co such that if £oo(t) < | then E(t) < CoE 2 {t). 

4.2. Allen-Cahn Equation: Circumventing Gronwall’s Inequahty 

We finally consider the singularly perturbed parabolic Cauchy problem 

£ (d t u, v) + £ (Vu, Vu) + - (/(u), v) = 0, u(0) = u 0 , 

where f(u) := u(u 2 — 1), and its finite element discretization: find U n E V n 

— (Un - InUn- 1, V) + £ (VU n , VV) + - (f{I n U n - 1), V) 

T n £ 

+ \ (f'(InU n -l)(U n - InUn-l), V) = 0 V V 6 V„. 

Let U(t) be the piecewise linear interpolant of {U n }^ =0 and 7 Z(t) be the residual 

(K(t), v) := £ (8 t U, v)+e (VU, Vw) + 1 (f(U), v) Vv € H\n). 

We assume that the initial condition uo is such that interfaces are already devel- 
oped. Then we have the following spectral estimate [3, 9]: 

e ll Vw lli 2 (n) + ^(f(u)v, v) > -A 0 £|M|| 2(n) , (42) 
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Theorem 4.2 (Polynomial Error Control). Let 6 > 0 be a tolerance, and let £q,£\ be 
parabolic error estimators based on 7 Z(t). If (42) is valid, then there exist constants 
(Ai)i= o> depending exponentially on Ao T but independent of e, such that if 

9 < Ao£ 4 , Il e (0)||i, 2 (ft) < Ai 6, £q < A 2 e6, £\ < A 2 e 2 9, 

then 

l|e||L oo (0 > T;L 2 (n) ^ 

We observe that the dependence on £ _1 is polynomial and not exponential, 
as it could be expected from a naive application of Gronwall’s inequality. This 
result is inspired in the a priori analysis of Feng and Prohl [13]. 
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The Total Variation Flow 

Matteo Novaga 



Abstract. We consider the gradient flow of the total variation functional, stat- 
ing general existence and uniqueness results. Particular attention is paid to 
self-similar solutions, which are partially classified. Finally, as an application, 
some explicit solutions of the denoising problem are given. 



1. Introduction 

The aim of this paper is to analyze the following variational parabolic equation 

Ut ~ div = °’ u e jLll ° c ^ 0, +0 °[ xRjV )’ 6) 
coupled with the initial condition 

u(0,i) = uo(i) € (2) 

some results on the problem (l)-(2) have been proved in [5], [6]. 

This equation corresponds to the L 2 -gradient flow for the total variation 
functional 




J rn 



starting from u$. Such a flow has many applications to image denoising and recon- 
struction, and numerical simulations have been performed by several authors [13], 
[10], [9], [16], [15]. Existence and uniqueness for solutions of (1) has first been es- 
tablished in the case of bounded domains (see [4] and references therein) and then 
extended to R N in [5] (see Theorem 2.4 below). The notion of solution employed 
in these papers is the so-called entropy solution , introduced by Kruzhkov [12] for 
scalar conservation laws, and first applied to parabolic equations in divergence 
form by Andreu et al. [2], in order to prove uniqueness with initial data in L\ oc . 

It has been proved in [3] that the solution of (1) reaches its asymptotic state in 
finite time, with an extinction profile which solves (up to a rescaling) the following 
eigenvalue problem 

-diV (|^[) = UE ( 3 ) 
In [6] (see also Sections 3 and 4) the solutions of (3) are partially characterized in 
the case N = 2. Particular relevance is given to solutions which are in ^^(M 2 ) 
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(see Proposition 3.4), or which are a finite sum of characteristic functions (see 
Theorems 4.1, 4.6 and 4.7). 

The solutions u which have no jumps, i.e. (-M)VwAM G VP^’^IR^) for any 
M > 0, can be constructed starting from the “fundamental solutions” u(x) = ±j^y 
(Proposition 3.4), and they can be obtained as limits of “tower solutions”. 

If a solution u of (3) is positive, then all the connected components of the 
sets {u > t} are convex (see Proposition 3.1), and one can construct solutions 
which look like a tower, i.e. u = a iXBi , for any choice of balls Bi such that 
B\ D B 2 D . . . D B p and for suitable constants > 0 (see Section 5). More 
generally, at least in two dimensions, one can construct more complicated towers 
(which can also oscillate) starting from sets which are not necessarily circles, but 
which must satisfy a suitable condition on the curvature of the boundary and on 
the mutual distance (see Theorem 4.6 and Section 5). These towers correspond to 
solutions of (3) which change sign, and in turn provide new solutions of problem 
(4) below. 

Indeed, a related question is to find solutions of the so-called denoising prob- 
lem, i.e. 



mm / 

ueBV(RZ) J R 2 



and, in particular, to determine functions / G L 2 (R 2 ) for which the solution u of 
(4) is constructed by the soft-thresholding rule. In Propositions 6.1 and 6.2 below, 
it is shown how to construct solutions of problem (4), starting from solutions of 
(3). 



2. Existence and uniqueness of solutions 

In the sequel, we will denote by L^(]0, T[; BV(R N )) the space of functions w : 
[0,T] — ► BV(R N ) such that w G iAQO, TfxR^), the maps t — > f RN 4>dDw(t) are 

measurable for any (f) G Cq(R n ] R n ) and J Q T \Dw(t)\(R N )dt < +oo. We will denote 
by Z4(]0,T[; BV\ oc (R N )) the space of functions w : [0,T] — > BV\ oc (R N ) such that 
wcj) G Z4(]0,T[; BV(R n )) for any 0 G Cg°{R N ). 

Definition 2.1. A function u G C([0, T]; L 2 (R N )) is called a strong solution of (1) 
if 

u G W^(0,T-L 2 (R N ))nLl(]0,T[-BV(R N )) 
and there exists z G L°° (]0, r[xE A ' ; E /V ) with ||-3|| oo < 1 such that 

u t =divz in V' (]0, TfxE^) 

and 

[ {u(t)-w)u t {t)= [ (z(t),Dw)— f \Du(t)\ Vw G L 2 (W N )nBV(R N ), (5) 

Jr n Jr n Jr n 

for a.e. t G [0,T]. 
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Notice that a solution of (1) of class C 1 and such that Du ^ 0 is also a strong 
solution, setting z := . 

We recall from [5, Theorem 2] the following result. 

Theorem 2.2. Let uo E L 2 (W N ). Then there exists a unique strong solution u of 
(1) and (2) in [0,T] x ~R N for every T > 0. Moreover, if u and v are the strong 
solutions of (1) corresponding to the initial conditions E L 2 {R. n ), then 

|| (u{t) - v(t)) V 0|| 2 < ||(^o - ^o) V 0|| 2 for any t > 0. (6) 

Since (1) is the gradient flow of a convex functional on L 2 (R N ), which is an 
Hilbert space, Theorem 2.2 can be proved by means of classical techniques, up to 
minor modifications (see [7] for a general approach to these problems). 

Notice that, putting w = u(t) in (5), it follows 

[ (z,Du) = f \Du\ . (7) 

Jr n Jr n 

Let us give a heuristic explanation of what the vector field z represents. Given a 
set E C R n of finite perimeter, denote by d*E the reduced boundary of E in the 
sense of De Giorgi (see [1]). Condition (7) essentially means that z has unit norm 
and is orthogonal to the level sets of u. In some sense, z is invariant under local 
contrast changes. To be more precise, observe that if u — c iXE { where Ei 

are sets of finite perimeter with disjoint closure, q E R and 

■ div (i^i ) = f e l2<r " ) ' (8 > 

then also — div(|^y) = / for any v = Y^a=i diXEi where di E R and sign(d^) = 
sign(c^). In an informal way, this means that a local contrast change of a solution 
of (8) produces a new solution. 

2.1. Entropy solutions 

In order to generalize the previous result by allowing initial data which are only 
in L 1 1 oc (R iV ), let us introduce the set V C VF 1,00 (M) as follows 

V := {p E W 1,00 (IR) : p' > 0, spt(p') compact}. 

Definition 2.3. A function u E C([0, T]; Ll oc (R N )) is called an entropy solution of 
(1) and (2) if u(t) converges to uq in L 1 1 oc (R iV ) as t — ► 0 + , 

p(«)eLi ) (]0,T[;BVioc(R JV )) VpeP, 

and there exists z E L°° (]0, R N } with H^Hoo < 1 such that 

u t — div z mV (]0 ,T[xR n ) (9) 

and 

-[ [ j(u-l)Vt+ [ [ rjd\D(p(u-l)) |+ [ ( 

Jo Jr n Jo Jr n Jo Jr n 



Z’Vpp(u — l) < 0 ( 10 ) 
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for all l G R, all rj G C°° QOjTfxR^), with 77 > 0, rj(t,x) = </>(t)ip(x), being 
<t> G Cq 0 (]0,T[), if G Cq^R^), and for all p G V, where j(r) := f p(s) ds. 

Jo 



Inequality (10) is a weak form of equality (5); indeed, substituting u t with 
divz in (5) and integrating by parts, using also H^Hoo < 1 and the fact that 77 is 
nonnegative, one gets 




z • V 77 p(u 



0 



/ j{u-l)tV- / 

Jm. N Jr n 



r] d(z, D (p(u — /))) 



> -/ j( u ~ l)tV ~ [ V d\(z,D{p(u-l))\, 

Jr n Jr n 

which, after integration in time, gives the opposite inequality in ( 10 ). 

From [5, Theorem 3, Proposition 4] one gets the following results. 



Theorem 2.4. Let uq G L 1 1 oc (R iV ). Then there exists a unique entropy solution of 
(1) and (2) in [0,T] x R N for all T > 0. Moreover, if uo,uok G Ll oc (W N ) are such 
that uok — ► Uo in L 1 1 oc (M Ar ) and u,Uk denote the corresponding entropy solutions, 
then Uk — > u in C([0, T]; Ll oc (R N )) as k — >• + 00 . 



Theorem 2.5. Let u 0 e LL(^ N ) and u be the entropy solution of (1) and (2). 
Then 



p(u)t G Lf oc ( 0 , 00; Li oc (R n )), t 2 p(u) t G Lf oc ([ 0 , 00 [; Lf oc (R w )), Vp G V. 



Moreover, if uo > —M for some M > 0, it follows 

u(t) + M 



u'(t) < 



t 



for a.e. t > 0 . 



Finally, 



Ut e L\ oc (]0, T[; iio C (K w )) 



for any T > 0. A similar statement holds if uq < M for some M > 0. 



Remark 2.6. If uq E L 2 (M Ar ), then the strong solution of (1) and (2) coincides 
with the (unique) entropy solution. 



3. Self-similar solutions 

The aim of this section is to characterize the solutions of (3), when N = 2, i.e. 
to characterize the solutions of equation ( 1 ) which are self-similar. Therefore, we 
shall consider to the following eigenvalue problem 

~ div =% UE ^“(K 2 )- 

An entropy solution of (11) is such that p(u) e BV\ oc (R 2 ), for any p e V. In 
particular, it follows that u G GBVi oc (M. 2 ). 

We recall from [ 6 ] the following regularity result. 
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Proposition 3.1. Let u be a solution of (11). Assume that u E Lf oc (£l) for some 
p> 2 and for some open set fl C R 2 . The following assertions hold. 

(a) For any t E R the sets {u > t) and {u > t} have a boundary of class C 1,a 
in f l , for some a E ]0, 1[ (C 1,1 when u E L^ c ( fl) ). 

(b) Ifu > a in fl (resp. u < a in fl) for some a E R then, for any t E R, 
curvatures k of d{u > t} T I f2 and h of d{u > t} fl fl satisfy k > a and 
h> a (resp. k < a and h < a) in the sense of distributions. 



In the sequel, given a function u as in Proposition 3.1 and t E R, we shall 
always identify the set {u > t} (resp. {u < t}) with its points of density one, which 
is an open set, and accordingly define {u > t} as the complement of {u <t}. 

Proposition 3.2. Let u be a solution of (11). Assume that u E fl 

for some open set C R 2 . Then for any t E R, t ^ 0, every connected component 

of d{u > t} D fl is contained in the boundary of a ball of radius l/\t\. 

Proof. Let t E R, t ± 0, let e > 0 and let fl e be the open set defined as fl e := {x E 
fl : | u(x) -t\ < e}. Let also 7 := d{u > t}nfl. Since u E L^ c (fl), by Proposition 
3.1 the curve 7 and the two curves 77 := d{u > t — e} D fl, 7^ := 9{u < £ + e} fl fl 
are of class C 1,1 . Moreover, since u E the two sets £7 := % fl 7 and 

X)+ := 7+ 07 are closed sets of zero 7f 1 -measure. Then the curve 7 \ (£7 U £^ ) 
is contained in Q e for any e > 0. Since 7 is of class C 1,1 , by (b) of Proposition 
3.1 it follows that the curvature of 7 belongs to (t — e,t + e) for any e > 0. The 
conclusion follows by letting e — > 0 + . □ 



Note that if u is as in Proposition 3.2 then the set {u > t] is a disjoint union 
of circles of radius jlj-, for any t ^ 0 such that the boundary of {u > t} is entirely 
contained in fl. 

The following result shows that there are no nontrivial solutions of (11) which 
are locally bounded and have no jumps. 



Lemma 3.3. Let u E ^^(R 2 ) H L^ C (R 2 ) be a solution of ( 11). Then u = 0. 



Proof. Assume by contradiction that sup u > 0 (the case inf u < 0 can be treated 
in a similar way). From Proposition 3.2 it follows that the set {u > t} contains 
an open ball B t of radius j, for any t E (0,sup u). Fix t E (O^up-u) and let 
t* := sup Bt u > t. Then a connected component of the set {u = t*} is a closed ball 
D t * C B t of radius From (11) one gets 



,. = 22 / uJx= <Q 1 f 

n JD t * n JD t * 



u dx — — 



(* 



*\2 



7 r 



/ 

JD t * 



divz dx = 2 1* , 



which is a contradiction. 



□ 



Hereafter, we shall consider solutions u of (11) which satisfy the following 
property 

V£ E R there exists an open set U t D d{u > t} such that u E {Ut). (12) 
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Observe that, if u satisfies (12), thanks to Proposition 3.1 all the level sets of u 
have boundary of class C 1,1 . 

The following proposition characterizes the solutions of (11) with have no 
jumps (see [6, Section 4]). 

Proposition 3.4. Assume that a solution u o/(ll) satisfies (12) and (—M)\/uAM G 
^(R 2 ) for any M > 0. Then one of the following possibilities holds: 

(i) u = 0; 

(ii) u is positive and the set {u > t} is a ball of radius for any t > 0; 

(iii) u is negative and the set {u < t} is a ball of radius — for any t < 0; 

(iv) u is nonnegative, {u > 0} is a halfspace and the set {u > t} is a ball of 
radius \, for any t > 0; 

(v) u is nonpositive, {u < 0} is a halfspace and the set { u < t) is a ball of 
radius for any t < 0; 

(vi) both {u > 0} and {u < 0} are halfspaces, the set {u > t} is a ball of radius 

\, for any t > 0, and the set {u < r} is a ball of radius for any 

T < 0. 

Proposition 3.4 essentially means that the solutions of (11) with no jumps can 
be obtained from the fundamental solutions u(x) = d= j^, by a suitable translation 
of the level-sets {u > t} (or {u < t}). 

4. Properties of the level-sets 

In this section we shall consider “tower solutions” of (11), i.e. solutions of the type 
u = i c iXEi for a suitable choice of sets Ei C M 2 and constants c* G M. 

Let us denote in the sequel by Co, C \, . . . C m ra+ 1 open sets with boundary 
of class C 1 ’ 1 with the following properties: 

Ci C Co for any i G {1, . . . , m}; 

Ci nCj = 0 for any i, j G {1, . . . , m}. 

Choosing k G {0, . . . , m}, let us define 

m m 

F-CoXUQ, F:=FU U Cj. 

i = 1 j=k + 1 

Let also v F be the exterior unit normal to dF. 

We want to identify necessary and sufficient conditions on F in order to have 
F = {u = t} and UJlfe+i Cj = {u> t}, for some t G R and for some tower solution 
u of (11). Using (9), this implies that there exists a vector field z G L°°(F, IR 2 ) 
such that 

— div z — t on F, 

. IMIoo < 1, 

[z, v F ] = -1 W-Si.e. on dCi, i G {1, . . . k}, 

[ 2 ,i/ f ] = 1 W 1 -a.e. on dCj, j G {k + 1, . . . m}, 



( 13 ) 
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where [z, u F ] denotes the trace of the scalar product between z and v F (see [5] for 
the precise definition and properties). 

Notice that a problem closely related to (13) has already been considered in 
the literature related to capillarity problems, see [11], [8]. 

On the other hand, given a function u = YldLi UXf^ where the sets F{ C M 2 
are disjoint, with boundary of class C 1,1 and such that IJ^i F 1 * = R 2 , then u is a 
solution of (11) if and only if there exist vector fields Zi G L°°(i^,R 2 ) satisfying 
(13) for any i G {1, . . . , M} and such that [z, v F ) + [z, v F \ = 0 on dFi D dFj. 

It follows that the characterization of the tower solutions of (11) reduces to 
solve problem (13). 



4.1. Bounded domains 

Let us first consider the case in which the set F is bounded. Set 



Jo := 



£L 0 m)-E7= fc+ im) 






F f (E) := P(E,F) + J2\d*EndCi\- £ \d*E n dCj\ - j 0 \E\. 

j = fc + l 



i=0 



By the Gauss-Green Formula, problem (13) reduces to 



— di v z 


= jo on F, 


ll^lloo 


< 1, 


[z,u F ] 


= — 1 on dCi,i £ (0, . . . k}, 


[ Z y] 


— 1 on dCj ,j € {k + 1, . . . m} 



(14) 



Finally, we denote by A the family of all sets E C F such that dEnint(F) consists 
of pieces of circumferences of radius 1/jo, which meet dF tangentially and which 
span an angle less than or equal to 7 r. We also require the contact angle to be 7 r 
on Uto and zero on Uj=/c+i and we impose that {0, F} G A. 

The following result is proved in [6]. 



Theorem 4.1. The following conditions are equivalent. 

(a) There exists a vector field z : F — > IR 2 satisfying (14). 

(b) There holds 



jo / w < / | Dw\ 
J F J F 



K p 

+E / w - 

i = 0 J 8Ci 



m r 

E / - 

JdC< 



Vw G BV(F). 



j=k + 1 



(c) For any E C F of finite perimeter, there holds 



F f (E) > 0. 

(d) There holds 

min Ff(E) = 0. 
EeA v ' 



(15) 



(16) 



( 17 ) 
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Remark 4.2. Notice that, when k — 0, jo tends to zero as Cq tends to M 2 ; in 
this case, the minimization problem (17) reduces to the problem considered in [5, 
Theorem 6]. 

Notice that, in most of the cases, the family A is a finite set, hence condition 
(d) is often easy to verify. However, Theorem 4.6 below will provide more explicit 
conditions, involving the curvature of dF and the distance between the sets C{. 
Let us start with the definition of the so-called ball condition [8]. 

Definition 4.3. Let £1 C R 2 be an open set with boundary of class C 1,1 , and let 
p > 0. We say that £1 satisfies the p-ball condition if a ball of radius p can be 
rotated along dQ in the interior of ft such that no antipods of the ball lie on <9£1. 

We recall the following result [8, Theorem 4.1]. 

Lemma 4.4. Let £1 C R 2 be an open set satisfying the p-ball condition, for some 
p > 0. Then sup^ ft < K Moreover, given a ball Bcflo/ radius p and tangent 
to dft, the set T fl dB is connected for any connected component T of dft, and it 
spans an angle less than i r. 

Observe that, in general, the inequality sup^ ft < - p does not imply the p- 
ball condition for the set £1. Let us also point out that if £1 is a convex set with 
boundary of class C 1,1 such that sup^ ft < then £1 satisfies the p-ball condition. 

Let F be as in Theorem 4.1. In the sequel, as soon as we say that F or F 
satisfies the ball condition, we shall always mean the i-ball condition. 

Remark 4.5. If Ci is convex for any i E {0, . . . m}, s\xp dCo ft < jo and 

dist (8Ci,dCj) > V(i,j) e {0, . . . k}, j, 

Jo 

then F satisfies the ball condition. 

The following result, proved in [6], provides a necessary condition and a 
sufficient condition for the existence of a vector field 2 : F — » R 2 satisfying (14). 

Theorem 4.6. Let F be as above and denote by k the curvature of dF. Assume 
that there exists a vector field z : F — > M 2 satisfying (14). Then 

sup ft < jo < inf (—ft), i G {0, . . . , k}, j G {k + 1, . . . , m}. (18) 

dCi dc 3 

Conversely, assume that F satisfies the ball condition, that the inequalities at the 
right hand side of (18) hold, and 

dist (8Ci,dCj) > V(i, j) E {0, . . . ,k} 2 U {k + 1, . . . ,m} 2 ,i A T (19) 

.70 

Then there exists a vector field z : F — > R 2 satisfying (14). 
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4.2. Unbounded domains 

Let us now consider the case Co = R 2 and C \, . . . , C m as above, which means the 
F is unbounded but R 2 \ F is bounded. In this case, problem (13) reduces to the 
existence of a vector field z G L°°(F, R 2 ) such that 

— div z — 0 on F, 

IMloo < T /Qrp 

[z,u F ] = — 1 W 1 -a.e. on dCi, i G {1, . . . k}, ' 

[z, v F ] = 1 H 1 - a.e. on dCj,j G {k + 1, . . . m}. 



There holds a result analogous to Theorem 4.1. 



Theorem 4.7. The following conditions are equivalent 

(i) Problem (20) has a solution. 

(ii) There holds 



0< 



/ ^ + tj 

JF i=1 JdCi 




W 



\/w G BV{F). 



(iii) For any X C F of finite perimeter, there holds 

m k 

P(X,F)> Y, 'H 1 (d* x n c j )-J2 nl ( d * XnC i) 

j=k+l i=l 

(iv) let E\ be a solution of the variational problem 

{ m k I 

P(E) : (J 

j=k+l i = 1 J 

then there holds 

m 

P(Ei)= Y 

j=k+l 

let E 2 be a solution of the variational problem 

[ k ml 

min i P(E) : |J Q C E C K 2 \ (J Cj\ , 



i = 1 



j=k + 1 



then there holds 



P(E 2 ) = YP( Ci). 



i—1 



(21) 

( 22 ) 

(23) 

(24) 

(25) 

(26) 



Notice that (iv) implies that each Ci is a convex set. Moreover, since any 
minimizer of problems (23) and (25) has a boundary (lying inside F) made of a 
finite number of segments which intersect tangentially dF (and there are only a 
finite number of such segments), the number of such minimizers is finite. Finally, 
conditions (24) and (26) are essentially distance conditions between sets Ci of 
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r L 

Figure 1 . A bean-shaped admissible set 
the same type, for example they are satisfied if dist (dCi.dCj) > P(Ci ), for any 



5. Some examples 



In order to clarify the conditions given in Section 4, we shall discuss some explicit 
examples. 



Example 1 . Let = x -r+ y 2 ; then u is a solution of (11) with no jumps, 

satisfying condition (yi) of Proposition (3.4). Notice that u is not continuous at 

(0,0). 

Example 2. Let F C M 2 be the set in Figure 1. It is easy to check that F satisfies 
the assumptions of Theorem 4.6, since it is a convex set with C 1,1 boundary and 
there holds 



1 2tt r + 2 L P(F) 

fr = r < ^*L=-m- 



(27) 



Moreover, since the inequality in (27) is strict, the solution of (1) starting from 
Xf' remains a characteristic function for any convex set F' of class C 1,1 close 
enough to F in the C^-norm. 



Example 3. Let F C M 2 be the union of two disjoint balls of radius r whose centers 
are at distance L. In this case, conditions (24), (26) of Theorem 4.7 become 



L > nr. 



It follows that, under this condition, the function u = 2 %f is a solution of (11). 
Example 4. We shall now describe a class of radially symmetric tower solutions. Let 
0 = R 0 < Ri < • • • < R p < R p + 1 = Too, and let := B Ri \B Ri _ 1 ,i = 1, . . . ,p+ 1 
(where we set B 0 — 0, i? +00 = M 2 ). Let also ai, . . . , a p +i be real numbers such 
that 1 = 0 and ^ R;+i, for i = l,...,p. Then, there exist real numbers 




The Total Variation Flow 



235 



&i, . . . , 6p+i, with 6 P+ 1 = 0 and 6* ± 6»+i, such that sign(5 m -6*) = sign(a* + i -a*) 
for any i £ {1, . . . ,p} (i.e. the numbers bi have the same “qualitative ordering” 
of ai) and the function u = Y^=i b iXsii ls a solution of (11). Indeed, applying 
Theorem 4.6 to the sets f one can check that the function u is a solution of (11) 
when 

b , = ? 

sign(a* - ai-i)Ri-i + sign(a* + i - ai)Ri ' 

Notice that, since Ri > Ri- 1 , it follows sign(6*) = -sign(a* +i — a*). 

This example has already been discussed in [14]. 

6. Solutions of the denoising problem 

In this last section, we will show how to obtain explicit solutions of the denoising 
problem (4), starting from solutions of (11), see [5, Proposition 7, Proposition 8]. 

Proposition 6 . 1 . Let A > 0, b £ R and u £ BV(M?) a solution of (11). Then 
the function v = au is the solution of the variational problem (4) with a := 
sign(fe) [(|fo| — A) V 0] and f := bu. Conversely , if v = au £ BV(M?) is the so- 
lution of (4) with f = bu, for some a, b £ R with b — a = ±A, then the function u 
is a solution of ( 11). 

The following proposition, which extends the previous result, clarifies how to 
construct solutions of (4) from tower solutions of (11). 

Proposition 6.2. Let Co, C \, . . . , C m C R 2 be open sets with boundary of class 
C 1,1 , such that Ci fl Cj =0 for i ^ j, (JHo ^ = an d Ci i s bounded for any 
i £ {1, ... , m}. Assume that the function u = Y^lLi c iXCi is & solution of (11), for 
some Ci £ M. Let now bi £ R, i £ {1, . . . , m}, A > 0 and ai = sign(6*)[(|&i| — A) VO]. 
Then the function u = YJiLi a i c iXCi is the solution of the variational problem (4) 
with f = Y!ill b i c iXC l • 
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A Mathematical Model for 
Diffusion-induced Grain Boundary Motion 

Oliver Penrose and J.W. Cahn 



Abstract. For a film of metallic alloy immersed in a suitable vapour, we use 
a system of four coupled nonlinear differential equations to model the steady 
diffusion-induced motion of: a grain boundary, the surfaces of the two grains, 
and the triple junction where they all meet. One of the equations models 
diffusion along the moving grain boundary; another models the force balance 
which determines its speed. The remaining two equations model diffusion in 
the surfaces of the two grains. The equations are linked by boundary condi- 
tions at the triple junction. The resulting system of differential equations and 
boundary conditions is solved here for the case of ‘trailing’ grain boundaries 
(ones where the growing crystal grain develops as a layer beneath the surface 
of the specimen rather than filling up the entire space between the two sur- 
faces) in a limit where the elastic driving force is very small. The main result is 
that for small values of Ac, defined as the (experimentally controllable) jump 
in mole fraction of solute at the triple junction, the growth velocity of the 
trailing grain is approximately proportional to (Ac) 4 , but for large positive 
Ac the velocity is approximately proportional to (Ac) 5 . The thickness of the 
trailing grain is approximately proportional to (Ac ) -2 for small Ac and to 
(Ac ) -8 ^ 3 for large. There is a negative value of Ac beyond which the model 
predicts that the velocity and thickness are independent of Ac, but this result 
should be treated with caution because the solution may be unstable. 



1. Introduction 

Diffusion-induced grain boundary motion (DIGM) was first observed by Hillert 
and Purdy in 1978 [9]. A thin foil of iron was placed in a vapour of zinc atoms. 
The zinc diffused along the boundaries between the different crystal grains in the 
foil, but not into the grains themselves, where the diffusivity is much lower. Parts of 
these grain boundaries were then observed to move, the grain on one side growing 
while the grain on the other side shrank. The zinc atoms which had diffused in 
along the grain boundary were left behind in the newly grown part of the crystal, 
thereby alloying it. Hillert and Purdy also demonstrated that DIGM occurred for 
de-alloying by placing a foil of an iron-zinc alloy in a vacuum. DIGM has been 
observed in many other alloy systems and with many geometries; for a review see 
ref. [7] . 
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There has been controversy in the metallurgical community about the force 
that drives DIGM. With a view to helping resolve the controversy, Cahn et al [2] 
put forward a phase-field model 1 in which the consequences of different assump- 
tions about the driving force could be worked out and compared with experiment. 
It was concluded that the most likely candidate was a mechanism suggested (in 
a slightly different context) by Sulonen [14] and first studied quantitatively by 
Hillert [8] ; this mechanism is based on the energy of elastic deformations caused by 
mismatch between the sizes of the two different types of atom 2 . It remains the only 
mechanism consistent with all experiments and with some insightful Gedanken ex- 
periments. 

The analysis given in ref. [2] is based on the assumption that the grain bound- 
ary is flat, but real grain boundaries are not flat: they can curve in the middle 
while still connecting the two sides of the foil, or they can curve so much that 
they separate into two parts, one close to each face of the foil. Both types of 
grain boundary were observed in the original experiments [9]. We shall call them 
connecting and trailing grain boundaries; they are illustrated in Fig. 1. 

To model these curved grain boundaries, Fife et al. [5] developed the mathe- 
matics of the phase-field model further, deriving a sharp-interface approximation 
which led to a system of differential equations for the shape of the grain boundary 
(represented in this approximation as a geometrical surface) . Some numerical stud- 
ies of both phase-field and sharp-interface models were described at this conference 
by Vanessa Styles (see also ref. [6]). 



2. The equations for the grain boundary 

The sharp-interface equations obtained by Fife et al. [5] can also be derived by 
physical arguments which do not rely on the phase field model [1,3]. There are two 
differential equations. One of them, describing diffusion within the grain boundary, 
can be written (in a specially chosen system of units, v.i.) 

d?c h 

—j^r = (Cgb(s) - c a (s))v cos 0 (-±7T < 6 < ±n) (1) 

Here c g b denotes the local mole fraction of solute within the advancing grain boun- 
dary, c a denotes the local mole fraction just in front of the advancing grain bound- 
ary, s denotes arc length measured along the grain boundary, v is the (positive) 
velocity of the grain boundary in the positive y direction (the y axis being taken 
parallel to the faces of the foil, as shown in Fig. 1) and 6 is the angle between the 
grain boundary and the positive x-axis, which is taken to be perpendicular to the 
face of the foil. The local mole fraction in the grain left behind by the advancing 
grain boundary is equal to c g b . 



1 For some existence and uniqueness results about the equations of this model, see ref. [4]. 

2 For a detailed analysis of this interaction, arising out of discussions at the conference between 
J. Ockendon and one of the authors, see ref. [13]. 
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Figure 1 . The two types of steadily moving grain boundary and 
the notation used in this paper. In this paper we solve the trailing 
case. Arc length in the grain boundary is denoted by s; in the solid- 
vapour surface, by a (with a > 0 in the shrinking grain, a < 0 
in the growing grain) . The angle between the grain boundary and 
the x-axis is denoted by 8 ; the angle between the solid- vapour 
interface and the y- axis, by </>. 



Usually c a is equal to Co, the concentration in the shrinking grain well ahead of 
the moving grain boundary, but near the surface there can be a thin zone in which 
the metal in front of the grain boundary arrived there as a result of deposition 
from the vapour; in this zone, c a is approximately equal to c eq , the concentration 
of solute at a flat interface in equilibrium with the vapour. Let x m be the smallest 
local minimum value of x reached by the surface of the shrinking grain at points 
other than the triple junction, and let x tj be the x-coordinate of the triple junction. 
If x m > x tj (as is the case in Fig. 1(b)) then the zone, if it exists, does not meet the 
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grain boundary, but if x m < x tj then its inner boundary meets the grain boundary 
at a point whose x-coordinate is x m . The rule determining c a can be written 

f c° if 0<*<min(w«), 

Ca \ c eq if x m < x < x tj . 

The second differential equation, expressing the local normal velocity v cos 6 
of the grain boundary in terms of the elastic driving force and the surface tension 
force due to the curvature of the grain boundary, is 

v cos 6 = d6/ds + e(c g b — c a ) 2 if t < 6 < ^7r, (3) 

where 

e := Yrj 2 DX/M^f 2 (4) 

is a dimensionless constant measuring the strength of the elastic interaction driving 
the DIGM. In this definition, Y is a certain elastic modulus, 77 is the rate of change 
of stress-free lattice spacing with respect to changes in mole fraction, D is the 
diffusivity in the grain boundary, A is its thickness, M is its mobility (velocity per 
unit driving pressure) and 7 is its energy per unit area. 

The units of measurement have been chosen so that DX = 1 and also M7 = 
1. In a general system of units, s would be replaced by (DX/M^y)s and v by 
[(M'y) 2 /DX]v, but the formula (4) would not change. Although the values of both 
D and M are not known with any accuracy for real metals and are expected to vary 
greatly with composition and with temperature (as an Arrhenius exponential of the 
reciprocal of temperature) only their ratio, which is less sensitive to temperature, 
appears in (4); hence the order of magnitude of e can be estimated from theory 
or experiment. Nevertheless, since rj can vary widely (from near zero in silver-gold 
to 0.15 in nickel-gold) we expect e also to vary widely. In the present paper we 
investigate the behaviour of this mathematical model in the limit of very small e. 

Eqns (l)-(3) are given here only in the form they take when — ^ir < 6 < 
on the whole of the grain boundary. For values of 0 outside this range the equations 
are different, reflecting the possibility of the grain boundary’s passing a point in 
the metal more than once; this generalization is considered in section 8. 

The solutions of eqns (1) and (3) satisfy boundary conditions at each end of 
the curve. One of these ends is the triple junction shown in Fig. 1, at which the 
grain boundary meets the external surfaces of the two crystal grains. For connect- 
ing grain boundaries, the other end is a similar triple junction at the corresponding 
point on the other side of the mid-plane of the foil. Assuming that the grain boun- 
dary is symmetrical about this plane, the concentration will be an even function 
of displacement from this plane and the angle will be an odd function, so that we 
need consider one half of the grain boundary only, with suitable conditions at the 
mid-point, which we also take as the origin of the arc length coordinate s: 

6 = 0 
dcgb jds = 0 
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For trailing grain boundaries, there is no symmetry, but instead there are condi- 
tions in the far distance behind the triple junction, namely 

<•> 

where the location of the point at which s = 0 is now arbitrary. 

In addition to the condition (5) or (6) there are two conditions at the triple 
junction itself. These conditions are the same for both cases. One of them (arising 
from the requirement that the chemical potentials must be continuous at the triple 
junction) can be written 

C tj = c eq (?) 

where c tj denotes the value of c g b at the triple junction and c eq is defined above. 
By varying the partial pressure of solute atoms in the vapour, the experimenter 
can control the value of c eq and hence that of c tj . The second boundary condition 
concerns the direction of the tangent to the grain boundary at the triple junction, 
i.e.j the value of 0 tj := 9(s tj ) where s tj denotes the value of s at the triple junction. 
In refs [2, 5] the simplest possible approximation was used, namely that the grain 
boundary is perpendicular to the surface (i.e. 6 tj = 0), but this approximation can 
obscure some interesting effects. To complete the specification of the boundary 
conditions, we need a condition from which 8 tj can be determined; this condition 
is considered in the following section. 



3. The Mullins theory for surface motion by surface diffusion 

In the presence of a grain boundary the surface of the specimen is not flat because 
there is a groove at the triple junction, where the angles are determined by the 
surface tensions (free energies per unit area) of the three interfaces meeting there. 
The resulting curvature affects the chemical potential at the surface, and gradients 
in this chemical potential cause matter to diffuse along the surface so that it is 
deposited in some places and removed from others. 

A theory of this surface motion by diffusion in the case of a pure substance 
was developed by Mullins [10]. According to his ideas, the chemical potential varies 
linearly with the curvature, the material flux along the surface is proportional to 
the gradient of the chemical potential, and the deposition rate on the surface is 
minus the surface divergence of the flux; the resulting local normal velocity of 
the surface is therefore proportional to minus the surface Laplacian of surface 
curvature, in our case d 3 (j)/dcr 3 where a is arc length along the surface and (j) is the 
angle between the surface and the y axis. If the surface profile of the metal moves 
like a wave travelling in the y direction with velocity v, this local deposition rate 
is —vsin </>, and so (j) satisfies the differential equation 

b d 3 (f) / dcr 3 = v sin (f) (8) 

where b is a dimensionless material constant. In a general system of units, b would 
be replaced by the material constant B used by Mullins, which has dimensions 
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(length) 4 (time) -1 and is related to b by the formula B = b(DX) 2 /(M7). Eq 
(8), which was given by Kanel, Novick-Cohen and Vilenkin [12], is the nonlinear 
generalization of an equation formulated by Mullins [11], which in our notation 
would read bd 4 x/dy A — vdxjdy. 

In ref. [11] Mullins found that the end of the moving grain boundary would 
make a fixed angle with the x axis, independent of the speed v (though it would 
change sign if the sign of v were reversed). His result can be written 

= -k 5 ( 9 ) 

where 5 is the discontinuity in 0, which is determined by the surface tension 
equilibrium at the triple junction. One of the results of the present paper is a 
generalization of (9), eqn (35) below. The extra term on the left side of (35) arises 
from the nonlinearity of eqn (8). The right side of (35) would be replaced by zero 
for the problems considered by Mullins and by Kanel et al.; this term arises from 
a further effect which we now consider. 

Eqn (8) was formulated for a pure substance in equilibrium with its vapour. 
Here, however, we are concerned with an alloy whose concentration may vary 
along the surface. Furthermore the formulation of the boundary condition at the 
triple junction has to be amended to take account of flow into and out of the 
grain boundary. The relevant generalization of Mullins’ theory has been considered 
by Brener and Temkin [1], who (using the same linearization approximation as 
Mullins [11]) obtain a pair of equations for the fluxes of both types of atom along 
the surface. The present paper applies to an alloy (e.g. iron-zinc) where the solute 
(zinc) is volatile while the solvent (iron) can be assumed non-volatile, in which 
case only one equation is necessary; for a travelling wave the equation is [3] 

b(l — c eq ) d 3 4> / dcr 3 = (1 — c a )v sin (10) 

where c a means essentially the same thing as in eqn (1), i.e. c a = Co at points 
which the surface is passing for the first time, and c a = c eq if the surface has been 
there before. In the rest of this paper we confine ourselves to the case where c eq 
and Co are close enough together to justify treating the factors 1 — c eq and 1 — c^ e 
as equal, so that (8) can be used in place of (10). 

The boundary conditions on (8) at infinity are, naturally, 

c/)(a) — > 0 as a — > ±00 (11) 

At the triple junction (where we take a — 0) the function 4>(cr) is discontinuous. 
The angles there are related to each other and to the angle at the end of the grain 
boundary by 

<M+o) - <A(-o) = <5 

</>(+ 0) + 4 >(— 0) = — 2 @tj (12) 

where in the second line we have assumed that the surface tensions of the two 
external surfaces are equal, so that at the triple junction the tangent to the grain 
boundary bisects the angle between the tangents to the two external surfaces. A 
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second condition comes from requiring the chemical potential of the solute to be 
continuous at the triple junction; since this chemical potential depends on the 
curvature, the condition is 

</>'(+ 0) = ^(-0) (13) 

where the prime attached to </> denotes a derivative. 

Finally, there is a condition arising from the fact that matter is conserved at 
the triple junction, so that the algebraic sum of the flows of solvent in the three 
surfaces meeting there is zero. Since the flows in the outer surfaces are proportional 
to the gradient of the curvature (j> f while that in the grain boundary is proportional 
to the gradient of the concentration there, the conservation condition can be shown 
[3] to take the form 

<A"(+0) -</>"(-()) = 6-V tr (14) 

where c' tj denotes the value of dcg^/ds at the outer end of the grain boundary, 
i.e. at the triple junction. In a general system of units the coefficient b~ x would 
be replaced by M'y/DXb = DX/B where B is the material constant mentioned 
earlier, which is defined in Mullins’ papers [10, 11]. 

For the problem considered in refs [11, 12], there is no flow of matter along 
the grain boundary, and so the right side of (14) is replaced there by 0. 

4. Analysis of the nonlinear surface diffusion equation 

In the phase space of the differential equation (8) there is a fixed point at 
(</>,<//,</>") = (0,0,0). Near this fixed point the equation can be approximated as 
the linear system (d/dcr)[</>, </>', </>"] T = A[(j), 0', </>"] T , where T denotes a transpose 
and A is the matrix 

0 10“ 

0 0 1 (15) 

v/b 0 0 

The expanding subspace of this fixed point is one-dimensional and is spanned by 
the vector (l,a,o; 2 ) T where a := (v/fr) 1 / 3 , so that on it we have <j)' = a4>,(j)'' = 
a 2 (j). The corresponding expanding manifold can be (locally) parametrized by the 
variable (f ) : in some neighbourhood of the fixed point the other two coordinates can 
be expressed as power series in </> whose linear terms correspond to the linearized 
solution, namely 

(j) = a(j)[l — + 628680 ^ “ 6521926320 ^ +"•] (16) 

4>" = aV[l-i<A 2 + ^ 4 - 40^95^ + ...] (17) 

These series were computed by requiring them to satisfy the equations 6' d(b' / dd> = 
0", W7# = a 3 sin0 

The contracting subspace of the fixed point is two-dimensional, being spanned 
by the real and imaginary parts of the vector (1, aou, a 2 cu 2 ) where u is either of 
the complex cube roots of 1. Since 1 + uj + u 2 = 0, any vector (</>, <//, </>") in the 
subspace satisfies a 2 (j) + cap' + 0" = 0. The corresponding contracting manifold 
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can be locally parametrized by the pair </>, ft] in some neighbourhood of the fixed 
point the remaining coordinate ft' can be expressed as a power series in these 
parameters whose linear term corresponds to the linearized solution: 



b" = a + I -15_p2 a 2 , J_ 3 _ 125 663 5 

v p q T 273 p -r 182 P H T 91 pq ' Qi H 73162635^ 



■ 12793 4 , 36256 3 2 , 3495 2 3 , 5961 4 , 184 5 , 

' 39020072 -^ ^ 14632527 ^ * ^ 1393574 ^ ^ ^ 4877509 *'** ^ 696787 ** 

where p := a 2 (j),q := aft. This series was computed by requiring it to satisfy 
ft dft' /d(j) + ft'dft' /dft = a 3 sin </>. 



5. The matching condition at the triple junction 

For the shrinking grain, a is positive and the function ftcr) satisfies the boundary 
condition (11) for a — > +oo; it therefore belongs to the contracting manifold and 
satisfies (18). For the growing grain, a is negative and the function ftcr) satisfies 
the boundary condition (11) for a — > — oo; it therefore belongs to the expanding 



manifold and satisfies (16), (17). The two second derivatives in 


(14) 


can therefore 


be written 

shrinking grain : 


</>"(+ 0) = -a 2 <t>{+0)-a4>'(+0) + ... 


by 


(18) 




= —a 2 (p(+0) - a(f>'{- 0) + . . . 


by 


(13) (19) 

(16) Uyj 




= — a 2 <^>(+0) — a 2 4>{— 0) + . . . 


by 


growing grain : 


0"(-O) = a 2 </>(-0) + . . . 


by 


(17) 



so that (14) itself becomes 

b- l c' tj = 4>"(+o) — </>"(— o) 

= -a 2 <p(+ 0) - 2c*y (-0) + . . . (20) 

Using (12), which implies r/>(±0) = — 6 tj ± 7 , 5 . we can write (20) in the form 

b~ l a~ 2 c' tj = 3 e ti + \8 + mp tj ,$) (21) 

where the higher terms in the series are incorporated in the function 

H(e,5) := -§ 0 3 + ±0*5 - ^06 2 - 2 ^ 3 <5 3 + 0(\e\ 5 + |< 5 | 5 ) ( 22 ) 



6. Solving the equations 

Only trailing grain boundaries will be studied in this paper. The connecting grain 
boundaries will be considered in a forthcoming paper [3]. To simplify the notation 
we write c(s) or c in place of c g b{s) from here on. 

The problem we consider is to find the speed v of the trailing grain boundary 
for a given value of the experimentally controllable parameter Ac := c eq — Co, 
which lies in the interval [—1, +1]. Our method is to find a family of solutions for 
eqns (1) and (3) satisfying the boundary condition (6), treating v as a parameter. 
Then we pick out the solution and the value of v so that the boundary conditions 
(7) and (21) are satisfied. 
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We shall make the approximation of assuming that x tj — x m , if positive, is 
negligibly small, so that (2) simplifies to c a = Co; this assumption will be justified 
a posteriori — see the discussion of eqn (40). It is convenient also to introduce the 
new variable 0 := — 6, so that eqns (1), (3) become 

d 2 c/ds 2 = (c — c o)fsin0 (23) 

di^/ds = e(c — Cq ) 2 — vsinip if 0 < 0 < n) (24) 



with the boundary conditions c — > Co, dc/ds — » 0, 0 — > 0 as s — > — oo. In the range 
where 0 < 0 < 7r, the order of the system can be reduced to two by going over to 
0 as independent variable: 



(e(c — Cq ) 2 — v sin0) 



90 



(e(c- c 0 ) 2 -vsinip)^ 



= (c — co)t’sin0 (25) 



with the boundary conditions c — > Co, [c(c — Co) 2 — v sin0]dc/d0 — > 0 as 0 — > 0. 

The solutions of eqn (25) depend on the material parameter e as well as on 
the experimentally controllable parameter Ac. We shall look for a linkage between 
these two parameters which leads to a family of solutions which are asymptotically 
self-similar in the limit e — » 0. To do this, we seek solutions whose asymptotic 
behaviour in this limit is 



c(0) - Co ~ e m c(0) 
v ~ e n v 



as 



(26) 



where m and n are constants, to be chosen later so that a useful self-similar family 
is obtained. To begin the process of choosing m and n, let us require 



2m + 1 < n; (27) 

then it follows that vsin 0 becomes negligible in comparison with ec 2 as e — > 0, so 
that (25) simplifies to 

e2+4m ^ ( g2 ^) +°( 1 )) = e ^ sin ^ < 28 ) 

with the boundary conditions c —> 0 ,c 2 dc/d0 — ► 0 as 0 — > 0. To obtain a useful 
result, we require 

2 + 4m = n (29) 

so that eqn (28) simplifies further to 

£ ^( £2 $) =i)sin ''' <3o) 

The solution of this equation satisfying the boundary conditions at 0 = 0 
can be computed as a series or by numerical integration. The series solution is 

« = ± (^) : 1/1 ^ /4 [l - A* 2 + + 3 + ■ ■ • 1 PI) 
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To apply the concentration condition at the triple junction, eqn (7), we first 
evaluate (31) at the triple junction, obtaining 

&t3 = ± (ijf ) ^ 4 ^ “ 4?2 + • ■ ■ ] ( 32 ) 

where ip tj — 6 tj is the value of V’ at the triple junction and c tj is defined, in 

analogy with (26), by c tj — Co = e m c tj . Then (7) becomes c eq — Co = e rn c tj , i.e. 

Ac = c m c ti . (33) 



To apply the triple junction angle condition (21), we need a formula for dcjds\ 
eqns (24), (26) and (31) give the asymptotic result 



dc 

ds 



He -Co ) 2 



„ sinV ,]^ e 3m+l a 2^ 

d'lp dip 



3e 3m+l / 160 \ 3 / 4 



15 



) + 4 ) 



x [1 - -85_,/, 2 + 6305 /,4 7198697 / 6 , 1 ('■m 

A l i 1416 ^ ' 4010112 ^ 357734071296 ^ \°*) 

The sign in (34) is the same as that of c — Co- When (34) is substituted into the 
triple junction angle condition (21), using also (26), the result is 

W tj + + H(6 tj ,5) = b-\v/b)- 2 /*c' tj 



= ) 3/ " 4 ,4 [i - + • • ■ i(i + o(Di ( 35 ) 

To make eqn (35) approach a useful limit as e — » 0, we require the right side 
to be formally independent of e, i.e., 

—2n/3 + 3m + 1 = 0 (36) 

Then (35) simplifies (after dividing by 3 and taking the limit e — > 0) to 

e„ + is + iH(e„,s) 

= ±+ (+ «‘ /12 4 /4 [l (37) 

where the sign is the same as that of c tj . The solution of eqns (29), (36) is m — 
l,n = 6, and these numbers also satisfy (27). Thus we can now be more specific 
about the ansatz (26) and the formula (33), replacing them by 

~ } as e-0 

V ~ € V J 

Ac = ec tj (38) 



From these formulas we can obtain information about the sizes of the struc- 
tures under study. The growing crystal grain forms a thin layer under the surface 
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of the specimen. Its thickness is the sum of two parts; one part is the depth of its 
inner surface below the triple junction, given by 



hx := 




cos 6 ds — 




sin ip dip 

e(c — Co) 2 — vsimp 



1 f^ tj simp dip 

e 3 Jo c 2 

1/15 \ 1//2 f^ tj simp dip 

e 3 \16fi/ J 0 ^ 3/2 [l - 



by (24) 
by (38) 



(39) 



by (31). The other contribution is the depth of the triple junction below the surface 
of the shrinking grain, given by 



~(b/v) f 0 ^ d?4>/ da 3 da 


by 


(8) 


-{b/vWi- o) 






~(b/v)(a 2 4>(- 0) + ...) 


by 


(19) 


(b/v) 1 ^(6 tj + + . . . ) 


by 


(12) 


c- 2 (&/fi) 1/3 (0 y + !* + ... 


) by 


(38) 



(40) 



Since e is small, the contribution hi dominates, and the thickness of the growing 
layer is of order e -3 . Obviously this must be less than the overall thickness of the 
specimen, so it is a necessary condition for the applicability of these results that 
the thickness of the specimen should be of order e ~ 3 or greater. If the specimen 
is thinner than this, then DIGM is possible, if at all, only for connecting grain 
boundaries, which are outside the scope of this paper. 

Formula (40) shows that the depth of the triple junction below the surface 
of the growing crystal is of order e~ 2 . The sizes of other features of the outer 
boundary such as the depth of the triple junction below the surface of the growing 
crystal, the difference in thickness of the two crystals, and the thickness of the zone 
comprising the parts of the shrinking crystal that are deposited from the vapour 
as the grain boundary advances, are likewise of order e -2 . From this it follows that 
x tj — x m — 0(e~ 2 ) < x tj — 0(e~ 3 ), justifying our neglect of x tj — x m in setting 
up eqn (23). 



7. The speed of DIGM 

To complete the calculation we want to find v, which determines the speed of the 
grain boundary and enters into the formula (31) for c(ip) from which the shape 
and size of the grain boundary can be calculated. We treat c tj as known (from (4) 
and (38)), so it is a question of solving the pair of equations (32) and (37) for the 
two unknowns v and ip tj . A reasonably accurate method of solution is first to take 
the cube root of (32) and eliminate v 1//12 between the resulting equation and (37) 
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to obtain 

(i^yi/3 = (41) 

Ytj 

where 

« ■■= ik < 42 > 

[1 _ _5_W,2 , 11/3 

■= [\* + -ip tj ,5) Y u 4? 85 J / ? , r ( 43 ) 

i 1 1416 + * ' • J 

As 'i/itj increases over its allowed range [0, tt] at fixed 5 the right side of (41) 
decreases monotonically from +oo to a negative value F(ir, <5) /7r. Eqn (41) therefore 
has a unique solution for i/; tj provided that ( Kc tj ) > — 8)/n\ 3 . The value 

of \F(tt,S)/tt\ 3 varies from (0.671 ... ) 3 = 0.302... at S = 0 to (0.181...) 3 = 
0.0059 ... at S = 7r. 

An estimate of the value of v for a given c tj can be derived from the following 
(unproved) bounds, which were obtained from graphs of F($ tj ,$) — jjrJ against i/; tj 
for various 5: 

0.35 - 0.8^. < F(il> tj ,5) - ±6 < 1.5 - 0.8^ ti (44) 

Using these in (41) and solving for we find 



JVi + ±5 

Vtj 0.8 + (Kc^) 1 / 3 



(45) 



where 0.35 < N± < 1.5. 

Solving (32) for v and then using (45) together with the fact that 0.903 < 
[1 - ^2 + ...]< 1, we obtain 



i.e. v 



16-0 3 






-4 



15cJ + 

~W 1, iv, + is ) Nl 

15Af 2 eAc 4 ( 0.8c 1 / 3 + (KAc) 1 ^ 3 



16 



Nx + U 



(l + o(l)) 



(46) 



where 1 < < (0.903) _1 = 1.504.... Thus, provided always that Kc > 

— |F(ir, (5)/7r| 3 , the speed of trailing DIGM is approximately quadratic in e and 
proportional to the fourth power of the applied concentration difference Ac when 
Ac is small (compared with (0.8) 3 e/if), but linear in e and proportional to the 
fifth power of Ac, when Ac is large and positive. 

Besides the speed, we can also calculate the thickness of the trailing layer. 
To leading order, the thickness is given by (39) as v~ x ! 2 times an increasing 
function of ^ which is approximately proportional to at small %p tj and 
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approaches a constant as ip tj — > n. Consequently, by (45) and (46), the thick- 
ness is approximately proportional to e -3 c“ 2 = e _1 (Ac)~ 2 at small Ac, and to 

e -3 c t ~ 5 / 2 c t "~ 1//6 = e -3 c t ~ 8 / 3 = e -1 ^ 3 (Ac) -8 / 3 at large Ac. 



8. Generalizing the equations of the model 



To treat the case Kc < — |F(7r, <5)/7 t| 3 we need to generalize the fundamental 
equations of our model to values of # outside the range — < 9 < used 

in (1) and (3). The new possibility is that cos 6 may be negative, describing a 
section of grain boundary along which s increases to the left instead of the right, 
so that the moving grain boundary may be traversing material points that it has 
already passed before. The formulation of the equations for such points does not 
involve any new (local) physics, just rewriting the old equations in a more general 
notation. 

For points on the grain boundary where cos# ^ 0, let us define a local unit 
vector n which is normal to the grain boundary and makes an acute angle with 
the positive y- axis. Because we are assuming a travelling wave, the local direction 
of motion of the grain boundary is in the direction of n, and its normal velocity 
in this direction is the positive quantity v \ cos#|. The extension of (1) to general 
values of 9 is therefore 

= (c 9 b(s) - c 0 (s) )t>| COS 0|, (47) 

where c a , as before, represents the mole fraction c just in front of the moving grain 
boundary. If the relevant material point has not been passed before by the grain 
boundary, c a is given by eqn (2); otherwise it is equal to the value of c g b at the 
same value of x for the most recent previous passing, that is c a (x,y) = c g b(x,yi) 
for the smallest yi > y at which (x,yi) is on a grain boundary. Eqn (47) can 
also be used at the places where cos# = 0 since the normal velocity there is zero 
regardless of the direction chosen for the unit normal vector. 

The generalized force balance equation says that the normal velocity v\ cos#| 
equals the sum of the curvature force and the elastic driving force, both reckoned as 
positive if they are in the direction of the above unit normal vector. The curvature 
force is d9/ds in the direction of the more usual normal vector, which is on the 
left of a person walking along the curve in the direction of increasing s. This more 
usual normal vector is equal to ilS(#), where 




if cos # > 0 
if cos # < 0. 



(48) 



Therefore the curvature force in the direction of n is S(9)d9/ds. The elastic driving 
force always acts in the direction of motion, and is therefore positive under our 
sign convention and is given by the same expression as in (3). Combining the three 
terms, we obtain the following generalization of the force balance equation (3) 



'u | cos # | = S(9)d9/ds + e(c g b(s) — c a (s)) 2 if cos# ^ 0. (49) 
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Here c a , as in (47), denotes the concentration just in front of the advancing grain 
boundary. An equivalent form for (49), which we shall refer to later, is 

d6/ds = vcos 6 — e(c g b(s) — c a (s)) 2 S(0) if cos# ^ 0. (50) 

Unlike (47), eqn (49) cannot be used at the places where cos# = 0; at such 
places the normal velocity of the grain boundary is zero and so the elastic force on 
the grain boundary is not given by the formula e(c gb (s) — c a (s)) 2 , which applies 
only when the normal velocity is non-zero. Nevertheless, if we include the natural 
stipulation that (so as to exclude kinks) # must vary continuously with s, eqn 
(49) does make a clear prediction about what happens at places where cos # = 0, 
provided also that sin# < 0 there. To obtain this prediction, note that eqn (49) 
implies, whenever cos# ^ 0, 

S(0)4- cos 9 
ds 

— sin#{ , u| cos#| - e(c gb - c a ) 2 } by (49) 
v\ cos#| + esm0(cg b — c a ) 2 (51) 

If we define f(s) := e~ vs \ cos#(s)|, it follows that df /ds < 0 whenever cos# ^ 0 
and sin# < 0. Now let So be some value of s at which # = r. At this value we 
have f(s) = 0 and sin#(s) < 0, and it follows 3 from the non-positivity of df /ds 
that f(s) = 0 for all s > so, and hence that # remains at the value — for all 
s > s 0 . 

This is a puzzling result. The model predicts that the grain boundary can 
have a straight part, as illustrated in Fig. 2, on which there is no normal velocity 
and no curvature; yet the mole fractions at nearby points on the two sides of 
the straight part are different, which leads one to expect an elastic force on it, 
in violation of the force balance principle used to derive (49). Mathematically, 
the difficulty is related to the singular character of the differential equation (50), 
whose right side is not a continuous function of s as cos # passes through the value 
— ^7r. The way round the paradox may be connected with stability; for although 
the result shows that the straight section of the grain boundary is stable against 
small perturbations within the class of travelling wave solutions, it does not prove 
stability against perturbations which are outside this class. The theory of these 
more general perturbations, in which the shape of the curve in Fig. 2 would depend 
on time, is outside the scope of this paper; all we do here is to work out the 
travelling wave solution from which such a stability analysis would start. 



ds 



cos# | 



< 



3 If there were an 52 > so with /(S 2 ) > 0, then by the continuity of 0(s) there would be an s i 
satisfying so < si < S 2 with 0 = /(so) < /(s i) < /(S 2 ), but this last inequality is incompatible 
with the non-positivity of df /ds when cos# ^ 0. 
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9. Predictions of the model for the case Kc < <5)/7t| 3 

The result just derived implies that the grain boundary consists of two distinct 
parts, as illustrated in Fig.2 : a curved inner part on which 6 > r, smoothly 
joined to a straight outer part, parallel to the faces of the crystal, on which 
0 = — 17 r. The outer straight part would be a solution to (50) if the last term were 
set to 0 by assumption when cos 8 = 0. 




FIGURE 2. Sketch of the grain boundary when Ac < —(c/K\F(7r,5)/ir\ 3 



On the straight part, eqn (47) is simply 

d 2 c/ds 2 = 0 . 



(52) 



At the smooth join c, dc/ds and ijj are continuous; let us denote their values there 
by c sj ,c f sj and ^ sj . Then eqn (52) tells us that 

c(s) = c 3j + {s- S sj )c' sj 

Ctj Csj “t” (.S i,j S s j')C s j 

C' tj = c'^0 (53) 



For the curved part of the grain boundary the method of solving the equations 
is nearly the same as in section 6; the only difference is that the places of c tj ,c' tj 
and 'iptj are now taken by c sj ,c' sj and = tt. The analogue of (41) can be written 

, 1 ( F(n,8) \ 3 _ 1 F(n,6) 3 

Csj K V 7T ) K 7T 



( 54 ) 
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Putting this into the analogue of the first line of (46), we obtain 

15c 4 , 



v = 



167T 3 [1 - jfjTT 2 + ... 14 



= 0.0455c? 



0.0455 / F(n, 8) 



12 



i.e. v 



K 4 \ 7T 

0.0455e 6 fF(n,5y 12 
K 4 



by (54) 

(l + o(l)) 



(55) 



Provided Ac is less than the critical value — (e/K)\F(Tr, <5)/7r| 3 , the velocity 
is independent of Ac : no matter how large and negative Ac is made, the velocity 
cannot be increased beyond the value given in (55). In a similar way it can be 
shown, using (39) and (40), that the thickness of the grain boundary is independent 
of Ac under these conditions. Increases of |Ac| beyond (e/K)\F(7r, 5)/7r| 3 serve only 
to increase the length of the straight part of the grain boundary, which according 
to (53) and (54) is the positive quantity 



(c 0 - c tj ) - (c 0 - c sj ) _ |Ac| - (e/K)\F{TT,5)/n\ 3 



(56) 



Apart from the number 0.0455, which comes from the solution of the grain 
boundary equations, the scaled speed v is determined entirely by the non-linear 
Mullins theory, depending only on the angle 5 at the triple junction and on the 
value of K = 4 /225b. However, although v is independent of c, it is extremely 
sensitive to the other parameters, particularly 5 : for example, when 5 = 0, the 
coefficient of K ~ 4 in equation (55) is 4 x 10 -4 , but when S = n this coefficient is 
7 x 10“ 11 (the smallness of these numbers is deceptive, since K ~ 4 could be large). 
Moreover, as we have mentioned earlier, the travelling wave solution underlying the 
derivation of the results in this section may well be unstable against perturbations 
which, if they move forward in the y direction at all, do so with speed less than v. 
The stability against such perturbations would presumably get worse as |Ac| was 
made larger, since any localized perturbation would (by (56)) have more time to 
grow before colliding with the triple junction. 
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Continuation of the Solution to the Chemotaxis 
Problem Beyond its Blow-up 



Mario Primicerio and Boris Zaltzman 



Abstract. In this paper we deal with classical solutions to the chemotaxis 
problem and characterize cases of blow-up. Next, we define continuation of 
the classical solutions beyond blow-up time To. This definition is in fact non- 
trivial because, although allowing finite mass to concentrate in the origin 
after T 0 (without prescribing its amount as a function of time), it identifies 
the solution uniquely. Moreover, two ways of regularizing the problem are 
provided: they look quite natural and they are shown to converge to the 
solution. 



1. Introduction 

Blow-up of solutions to the system of partial differential equations modelling 
chemotaxis has been investigated recently by several authors (see e.g. [1], [2] ). But 
also in the case of the basic Keller-Segel model ([3]) some problems have received 
only partial answer. Referring to that model and confining to a two-dimensional 
radial-symmetric problem in an infinite domain, we will try to contribute to a 
better understanding of this mathematical phenomenon. 

In all radial-symmetric problems, the Keller-Segel system can be reduced to 
a single equation for a suitable mass function; this equation shows clearly that the 
phenomenon is the combination of diffusion and nonlinear convection. When the 
latter prevails, blow-up occurs, whereas when diffusion provides sufficiently strong 
dissipation, no singularity appears for the concentration. 

This balance is dependent on the number of dimensions, so that in one space 
dimension there is no blow-up and eventually the equation is a Burger’s equation. 
In three dimensions blow-up occurs for any size of initial datum, while in the 
two-dimensional case we will see that either the diffusion term or the nonlinear 
convection can be dominant, depending on the size of initial datum. In case of 
blow-up (which will occur in the center for symmetry reasons) some additional 
questions arise. Will all the mass initially present concentrate instantaneously in 
a ’’delta function” in the origin? And if this is not the case, is there any way of 
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continuing the solution after blow-up? Moreover, can a regularization be performed 
so that this continuation can be actually computed? 

In Section 2 of this paper we deal with classical solutions and characterize 
cases of blow-up by a comparison technique using as a barrier the only nontrivial 
asymptotic solution. In Section 3 we define continuation of the classical solutions 
beyond blow-up time To. This definition, although analogous to the already used 
definition of weak solution, is in fact non-trivial because, although allowing finite 
mass to concentrate in the origin after To (without prescribing its amount as 
a function of time), it identifies the solution uniquely. Moreover, two ways of 
regularizing the problem are provided: they look quite natural and they are shown 
to converge to the solution. 



2. Classical solution 

The two-dimensional radial-symmetric problem we will consider is the following, 
ru t = ( vru r — rup r ) r , r > 0, t > 0, (1) 

( rp r ) T = -ru, r > 0, t > 0, (2) 

with the initial condition 

u(r, 0) = u 0 (r), r > 0, (3) 

where uo(r) will be assumed such that lim r _, 00 uo(r)r 2+a = 0, for some a > 0, so 
that the total initial mass M = 2tt J 0 °° ruo(r)dr is finite. 



We define the mass function SDt(r, t) as 

r r 

3Jl(r, t) d = 2n / pu(p, t)dp. 

Jo 

Then (1), (2) reduces to the single equation 

r fat 



Tit = vr 



TlWtr 

+ — , r > 0, t > 0. 



2n r 



Next, define 



r ‘2 rV2s 

s = — , M(s,t) = DJl(y/2s,t) = 2n pu(p,t)dp. 
2 Jo 

Then, (5) becomes 



and (3) gives 



M t = 2 vsM ss + — - MM S , s > 0, t > 0, 
27T 



rV2s 

M(s, 0) = M 0 (s) = 27 r / pu 0 (p)dp , 

Jo 



( 4 ) 

( 5 ) 

( 6 ) 

( 7 ) 

(8) 



so that lim^oo Mq(s) = M. Our first aim is to discuss the solvability of the 
following 
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Problem 2.1. Given a bounded nondecreasing function M 0 (s) G C 1 ([0, oo)), 

M 0 ( 0) = 0, find T > 0 and a bounded function M(s,t) € (7([0, oo) x [0 , T)) D 
C 2,1 ((0,oo) x (0,T)), satisfying 

( M t = 2 vsM ss + -^M M s , s > 0, t G (0, T), 

< M(s,0) = M 0 (s), 5 > 0, (9) 

[ M(0,*)=0, *e(0,T). 

M(s,t ) will be called the classical solution to the problem and the interval (0,To), 
Tq = sup{T}, /im£e or infinite, will be called the maximal interval of existence. 

First we state the following 

Proposition 2.2. Any solution of Problem 2.1 is such that 

0 < M < M, >0, s > 0, te (0, Tq), (10) 

Proof A straightforward application of the maximum principle yields (10). □ 

Next, we prove the following result on monotone dependence upon the data. 

Lemma 2.3. Let M\, M 2 be classical solutions of (9) corresponding to the data 
Mio(s) and M 2 o(s) respectively, let Tqi and T 02 be their respective maximal inter- 
vals of existence, and let Tq = min (Tqi, T 02 ). Then, if 



Mio(a) < M2o{s), s > 0, 


(11) 


Mi (s,t) < M 2 (s,t), s > 0, t e (0,T 0 ). 


(12) 



Proof Consider 

V(s,t) = M 2 (s,t) - Mi(s,t), s > 0, te( 0,T o ), 

and note that 

Vt = 2 vsV,, + Ml f M2 V s + - (M 1 + M 2 ) s , s>0, «€ (0,To). (13) 

47 T S 

Note that V is nonnegative for t — 0, s > 0 because of (11) and vanishes on 
5 = 0. Hence (12) follows by maximum principle (consider e.g. an approximation 
V £ whose data are V £ (s , 0) = V ( s , 0) +£, F £ (0, t) = e and prove that V £ can never 
reach the minimum value e/2). □ 

Essentially by the same arguments, we can prove 

Lemma 2.4. Let M solve Problem 2.1. Then the following implications hold: 

Mq(s) <0 in =» M 8S (s,t) < 0 in i? + x (0,T), (14) 

47ti/sMq(s) +M 0 (s)M'(s) > 0 in R + M t (s,t) > 0 in R + x (0,T), (15) 

Investigating the stationary solutions to (7), we obtain the following 
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Proposition 2.5. Equation (7) admits nonconstant smooth stationary solutions 
M(s) of finite mass M only if M — Stti /. M(s) has the form 

M(s) (16) 

for arbitrary (3 > 0. 



We also have 



Proposition 2.6. Let 

poo 

Moo(t) = 2tt / ru(r,t)dr — lim M(s,t). (17) 

Jo s ^°° 

Then it is 

Moo(t) - Moo(0) = M, Vt G (0,T o ). (18) 

Now, we state and prove the main result of this section. 

Theorem 2.7. If 

M < 8nv, (19) 

then Problem 2.1 is uniquely solvable and To = oo. 

If, on the contrary , 

M > 87 xv, (20) 

then Problem 2.1 is uniquely solvable, Tq is finite and 

lim M(s,t) > 87 rv, Vs G R + . (21) 

t—+T 0 — 

Proof Local existence is rather standard. Uniqueness for a given T is a corollary of 
Lemma 2.3. We also note that as long as lim s _+o+, M(s, t) = 0 (i.e. as long as 7/(0, t) 

t^T- 

is finite) the solution can be continued beyond T. First, we prove Theorem when 
M < 8ixv. Indeed, the stationary solution (16) is uniformly and monotonically 
convergent to 8irv in any closed subset of R + when f3 — > 0. Hence, for sufficiently 
small /?, Mq(s) < M(s). Therefore, Lemma 2.3 enables us to claim that 

M(s, t) < M(s) in R + x (0, T) (22) 

and thus the solution can be continued beyond any T > 0. Next, we prove that 
the same is true also when M = 87 tv. Let 



and define 



u = max(87rz/, maxMn(s)), 
[ 0 , 1 ] uv 



J us , s G [0, 8tti//u\, 
[ 87TIS, s G (87TZ/, + 00 ] . 



(23) 

(24) 



If T, v(s, t) is the solution of Problem 2.1 with initial datum vo(s), Lemma 2.3 
ensures that M(s,t ) < v(s,t ) in their common interval of existence and that 
Tq > T. So, to complete the proof of the first statement of the theorem we only 




Continuation of the Solution Beyond its Blow-up 



259 



need to prove that T = +oo. To this end, let e > 0 and let P £ be the solution of 
the following problem 



Pet — ‘IvePzss + — s > t E (T/2,T), 

Pe(s,T/ 2) = v(s,T/ 2), s > 0, P e (0,t) = 0,t€ (f/2,f). 



Since v ss < 0 in R + x (0,T) (recall (14) after proper smoothing of vo(s)), we have 
v(s,t) < P e (s,t) in R+ x (f/2,f). (25) 

Consider the nonlinear wave propagation equation 

Pt = in R+ x (T/2,T), 

P(s,T/ 2) = v(s,T/2), s > 0, P e (0,t) = 0,t€ (T/2,T). 



The problem is solvable, since v'(s,T/ 2) > 0, and 

P(s,t) < 8 tti/, in R+ x (f/2,f). (26) 



Using (25) and its limit for e — > 0 and (26) we have that, for any t G (T/2,T), it 
is v(s,t) < 87n/, s > 0, and taking i as a new initial time we are back to the case 
studied above. To conclude the proof of the theorem we have to consider the case 
in which (20) holds. Define 




r 3 u(r,t)dr , t E (0,T). 



(27) 



Multiply equations (1) and (2) by r 3 and integrate over i? + . After integration by 
parts we find 

M — 

^t = — (S^-M), t E (0,Tq), (28) 

where (18) has been taken into account. Since G(t) > 0 in (0 ,Tq), it is clear that 



2ttG(0) 

— M ) ’ 



(29) 



thus excluding global existence of classical solutions when (20) holds. To prove 
(21), assume that for a given s o > 0, there exists 6 > 0 such that 



M(so,To) = lim M(so, t) = Snu — 26. (30) 

t—*To — 

Therefore, for some a > 0, M(so,t) < 87 w — 6, t E (T 0 — a, To), and, hence, (recall 
M s > 0) 

M(s, T 0 — a) < 87 tv — 5, s E [0, so] • (31) 

But this would imply the possibility of defining M(s) as in (16) with sufficiently 
small (3 > 0 in such a way that 

M(s,T 0 -(t) <M{s ), seR+. 



(32) 
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By Lemma 2.3 

Af(s,T 0 ) <M(s) (33) 

and thus we find that contradicting (21) would imply that (0 ,Tq) is not the max- 
imal interval of existence of classical solution. □ 

3. Global in time solutions 

We state 

Problem 3.1. Given a bounded nondecreasing function Mq(s) £ C 1 ([0,oo)) with 
Mo(0) = 0 ; such that M > 87n/ ? let Tq be the maximal interval of existence of 
the classical solution. We look for a bounded nondecreasing function M defined on 
x (0, oo) and such that for all T > 0 

Mt = 2vsM ss + M s , s > 0, t £ (0,T), 

M(s,0) = M 0 (s), s>0, 

"«>•'> = { >°i, Itn, 

M(s,t) will be called global-in-time solution to our problem. 

Remark 3.2. The global solution is a “gluing” of a smooth classical solution and 
a singular (at r = 0) “weak” solution at the blow-up instant t = Tq. 

As a preliminary, we note that classical results on parabolic equations yield 

Proposition 3.3. For any global-in-time solution we have M £ C°°(R+ x (o ,T)). 

Let us prove that blow-up singularities never “melt” , in the sense that 



Lemma 3.4. For any weak solution we have 

M(s,t) > 87 xv for all s > 0, t> Tq. (35) 

Proof. Using Proposition 3.3 we find 

M t = 2 usM ss + MM S , s > 0, t > T 0 , (36) 

27 r 

M(s,Tq) > 87 tv, s > 0, (37) 

M(0,t) >0, t >T 0 . (38) 

Applying Lemma 2.3, we obtain that 

Af (s, t) > M(s) in R+ x (T 0 , 00) (39) 

where M(s) is any stationary solution defined by (16). Since the stationary solu- 
tions are uniformly and monotonically convergent to Situ in any closed subset of 
R + when (3 — > 0, (39) yields the estimate (35). □ 



Next, we prove uniqueness of the global-in-time solutions. 
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Theorem 3.5. Let Mi and M 2 be continuations of the same classical solution be- 
yond To. Then Mi = M 2 in R + x (Tq,T). 



Proof. Integrating the equation 



{sM s 



1_ 

2v 



M t - — [M 2 - SnuM ) , 



(40) 



(41) 



in s and £, we find for any global-in-time solution 

sM s e L°°(0, oo;L 1 (0,T)) for all T > 0. 

Define N(s,t ) = Mi(s,t) — M 2 (s, t). Because of Proposition 3.3, 

N t = 2 u{sN s ) s + f (. N(M ! +M 2 - 8 t ri/)) a 

47 r 

is satisfied in R + x (To,T). Moreover, N(s,To) = 0 in R+ . Multiply (41) by 
f 1 (N(s,t )) , where the functions / 7 (iV) are smooth approximations of sgn(N) : 

= 7>0 ' (42) 

Then, for any d > 0, integrate on (J, -boo). After integration by parts the following 
inequality is obtained 

d sN 2 r°° 

- / ds < 2 1 / [-s 2 / 7 1V s ] \. =s + 2u / f-yNds— (43) 

Js v N 1 +7 Js 

1 1 /’ 00 

[JV/, (JW, + M 2 - 8?n/)] |„ f - — / N(M, + M 2 - Swv) ^ 



a 6 s «i S - 



r 

7 0 



iV (Mi + M 2 — 8ttu) f 1 ds. 



Now, integrate in £, take the limit 7 — ► 0, let 5 tend to zero and apply (40) to 
obtain 

/‘OO -j /*oo 

/ s|W(s,t)|ds<-— / liVKAfi+Ma-lfiTTi/)^, fe(T 0 ,T), (44) 

Jo 47r Jo 

and making use of Lemma 3.3 completes the proof. □ 

To prove the existence, we will use a monotonicity argument based on two 
different types of regularization. 

(i) First, we regularize the Keller-Segel model as we did in [4], [5], i.e. by 
assuming that the coefficient of chemotactic response vanishes when the concen- 
tration of the cells exceeds a threshold value. The regularized problem takes the 
form 



ru £ t = ( uru £ - H ( - - u £ ) ru £ p £ r ) , r > 0, t > 0 



i r Pr) r = ~ ru£ » V > t > 0 , 

u £ (r, 0) = uo(r), r > 0. 



(45) 

(46) 

(47) 
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In (45), H is the Heaviside graph. 

(ii) Alternatively, regularization will be based on the assumption that the 
coefficient of chemotactic response vanishes for r < e : 

ru £ t = ( isru £ — H (r — e) ru £ p £ ) r , r > 0, t > 0, (48) 

( r Pr) r = - rU< % r > t > °, (49) 



u £ (r, 0) =uo(r), r > 0. (50) 

In both cases, by sending e to zero we will obtain the global-in-time solution 
to the original problem. 

We start with approach (i). Through the same steps used to obtain (7), we 
get for £ < £ 0 = 27r/ sup Mg 



Ml = 2 vsMl + 
M e (s, 0) = Mo(s), s> 0. 




M £ Mg, s > 0, t > 0, 



(51) 

(52) 



The cutting effect on the nonlinear term provides the existence of a unique global 
solution (48), (49) such that M £ (0,t) = 0, t > 0. 

Next, we prove 



Lemma 3.6. Assume 



Then 



M" <0, s > 0. 



(53) 



M £ s <0, a.e. s > 0, t > 0. 

Moreover, for two different initial data Mio, M 20 satisfying (53) it is 
M w (s) > M 20 (s) =>• (s, t) > M|(s,t). 



(54) 

(55) 



Proof We apply the argument of Lemma 2.4 in the domain where Mf < 2 / n/e. 
To prove (55), note that when (54) holds then 



27T 



H I Mf = H 1 

e / V £ 



= h( 



M t 



(56) 



Then , we approximate H in L 2 by smooth monotonic functions H n and we are 
reduced to prove (55) for the corresponding smooth solutions Mf , Mf. But this 
follows from the maximum principle applied to the difference N = Mf — Mf, 
N(s,0) > 0 that satisfies 

N t = 2 vuN ss + T H W M fN s + ^-H^MI s N+^ Hn (v)NMIM! s , s>0,t>0, 

(57) 



where 
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Now we consider the following nonlinear wave propagation problem 

P t = PPs, in i? + xi?+, (58) 

P(s,0) = M 0 (s), 5 > 0. (59) 

Note that P(s, t ) is the limit of the solutions to the following family of problems 

for Burgers equations 

Put = \ Pnss + PnPns in R + X R + , (60) 

P n (s,0) = M 0 (s), s > 0, P n (0,t) = 0, t > 0. (61) 

We can prove the following comparison lemma 
Lemma 3.7. If (53) holds , then 

M £ (s, t) < P(s,t), s > 0, t > 0. (62) 

We also have 

«). = Ml + ± ( m t - ±HM £ M ! ) , 
and hence, for any T > 0 

+ < 63 > 

and condition (i) in Problem 3.1 is fulfilled uniformly with respect to e > 0. 

Now, we are able to prove 

Theorem 3.8. Assume (53) holds. Then the limit of the solutions of (51)— (52) as e 
tends to zero coincides with the classical solution in (0,Tq) and provides its unique 
continuation beyond To. 

Proof. Because of (56), H is nonincreasing both with respect to £ and to M, so 
that 

£i < £2 => M £l < M £ 2 in x P + . (64) 

From (64) we have that there exists a function p(x,t) such that 

/x(s,t) = lim M £ (s,t), s > 0, t > 0. (65) 

£—►0 

It is easy to check that fi fulfills the conditions in the statement of Problem 3.1 
and that it is smooth in i? + x i? + . Next, compare M £ with the (classical) solution 
M(s,t) of Problem 2.1 in its maximal interval of existence. Noting that H < 1, 
we have M e (s, t) < M(s, £), s > 0, t E (0, To) and passing to the limit as e [ 0 

/i(s,t) < M(s,t), s > 0, t e (0,T 0 ). (66) 

But this means that fi{ 0, t) = 0 in (0, To) and hence (To, p) solve Problem 2.1 and, 
since the classical solution is unique, 

p(s,t) — M(s,£), s > 0, t e (0,T 0 ). 



(67) 
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The use of the uniqueness beyond the blow-up (Theorem 3.5) completes the proof. 

□ 



Now we consider the regularization (ii) with the aim of avoiding assumption 
(52). We denote by M e the solution of 

Ml = 2 vsMl + Tff (s - y) M e Ml s > 0, t > 0, (68) 

M E (s, 0) = Mq(s), s> 0, and M e (0,t) =0, t > 0. (69) 

We proceed as in proving Lemmas 3.5 and 3.7 and obtain the uniform estimates 



Lemma 3.9. The approximating solutions are bounded from below by the diffusion- 
dominated problem and from above by the nonlinear wave propagation problem: 

Q(sff) < M £ (sff) < P(sff), s > 0, t > 0. (70) 

Moreover M £ depends monotonically on the initial datum. 



Proof. No major changes are needed in the proofs of Lemmas 3.5 and 3.7 to obtain 
the above statement. Let us remark that in the present case we do need assumption 
(53), since the “cutting” in (68) does not depend on M £ . □ 

Theorem 3.10. The limit of the solutions to the approximating problems (68)-(69) 
is the global-in-time solution to Problem 3.1. 

Proof. The proof is a slight modification of the argument of Theorem 3.8, and it 
is based on the use of the monotonic dependence of the solutions on £ > 0. □ 
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Shape Deformations and Analytic Continuation 
in Free Boundary Problems 

Fernando Reitich 



Abstract. In this paper we present a general approach to the analysis of sta- 
bility and bifurcation of steady states and travelling waves for a class of free- 
boundary problems. The approach is based on the explicit consideration of 
series solutions in a parameter measuring the variations of the domains (e.g. 
an initial configuration or a specific steady state) from separable geometries 
(e.g. planes, spheres, etc). The method relies on the derivation of formulae for 
the recursive evaluation of (Taylor) approximations of arbitrary order in the 
variation parameters, and on the iterative estimation of the growth, in appro- 
priately defined spaces, of the resulting functional coefficients. We describe 
a variety of instances where this procedure leads to general results on ana- 
lyticity of solutions, on existence of complex steady- state and travelling- wave 
solutions and on nonlinear stability of equilibria. We further show that the 
relevance of these results goes beyond the theoretical. Actually, it also leads 
to an understanding of the mechanisms behind the observed performance of 
a class of numerical algorithms, based on shape-perturbation theory, which 
have been extensively used in simulations of free-boundary (and boundary 
value) models. Indeed, our research shows that the standard implementations 
of these schemes can suffer from severe ill-conditioning as a result of pro- 
nounced cancellations in the underlying recursions. Moreover, our work fur- 
ther suggests alternative implementations with greatly improved properties 
of numerical stability and convergence, enhanced by methods of analytic con- 
tinuation. As we demonstrate, these modifications can have a dramatic effect 
on the accuracy and applicability of perturbative numerical approaches. 



1. Introduction 

The issues of stability and bifurcation of steady states or other distinguished solu- 
tions (e.g. travelling waves) of free-boundary models are of central importance in 
applications, both in process design (e.g. solidification) as well as in their control 
(e.g. tumor growth). A classical approach to these issues is based on the analysis of 
linearized equations , leading to “linear stability analysis” or to “bifurcation equa- 
tions” . Of course, these analyses are only suggestive of the behavior of solutions, 
as any definitive assertion would entail the study of the full non-linear system. 
These latter studies, on the other hand, are significantly more complex than their 
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linearized counterparts, and they typically involve a completely different approach. 
In particular, nonlinear analyses seldom shed direct light on the order and nature 
of possible instabilities and, in addition, their highly abstract nature may render 
them inadequate for numerical simulations. In contrast, linearized approaches can 
clearly identify first order instabilities, and they also deliver linear systems whose 
simulation can be readily implemented. A most natural question then relates to the 
possibility of upgrading classical linear analyses to full nonlinear studies, by itera- 
tively computing and analyzing higher-order corrections to the linearized model. 
In this paper we tackle this very question with a view to providing a general, direct 
setting for the theoretical and numerical analysis (stability, bifurcation) of steady 
states and travelling waves of free-boundary models. 

A most interesting aspect of our program is our demonstration that a direct 
approach to analyzing higher-order corrections to the linearized models will typi- 
cally fail. Indeed, the higher order systems obviously correspond to inhomogeneous 
instances of the linearized equations, where the source terms depend on solutions 
to lower-order systems. Unfortunately, as we show in the next section, these source 
terms arise as sums that entail significant cancellations of singularities , as the most 
singular part of each term in the sum exactly cancels out to yield an appropria- 
tely regular right-hand-side. This simple fact has rather dramatic consequences, 
both in theory and simulation. On the theoretical side, these cancellations pre- 
clude a straightforward iterative estimation of the higher-order corrections, as any 
use of the triangle inequality destroys these, rendering a useless bound (see §2). 
With regards to numerical simulations, on the other hand, the cancellations can 
be identified with the source of numerical instabilities that have been observed in 
standard algorithms based of shape deformations [3, 4, 5, 11, 12, 16, 17]; see §4. 

As we have found, however, these theoretical and numerical impediments 
can be overcome. Indeed, as we show in §3, a simple change of independent vari- 
ables that collapses the perturbed domains onto the unperturbed geometry has 
the beneficial effect of implicitly accounting for cancellations. This observation 
then can be used to advantage in both theory and practice. In the theory, the 
lack of significant cancellations in the new variables allows for a direct estimation 
of the higher-order terms in the perturbation expansion; and, from a practical 
standpoint, this realization suggests a number of alternative implementations of 
perturbative schemes that exhibit enhanced stability and convergence properties. 
The rest of the paper is devoted to substantiating these assertions. First, in §2, 
we provide a description of the origins and nature of the cancellations that arise 
as we consider higher-order terms in the perturbation series of a field quantity 
with respect to variations of its domain of definition. There, we further show how 
these cancellations conspire against a direct estimation of the terms in the series. 
Then, in §3, we briefly explain how cancellations are mollified by a change of vari- 
ables, thereby allowing for iterative estimates. We close, finally, with a description, 
in §4, of some algorithmic improvements on numerical methods based on shape 
perturbations that are motivated by the theory. 
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2. Cancellations in higher-order corrections 

As will be clear from the presentation, our observations, results and numerical 
algorithms will be relevant to rather general free-boundary problems, just as first- 
order linearized stability analyses are. Specific examples, in d spatial dimensions 



(d = 2 , 3 ), include the classical (e.g. quasi-steady, one-phase) Stefan problem [8] 


AC = 0 


in Q(t) C R d , t > 0 




K--V 

o — v n 

on 


on dQ,(t), t > 0 


(1) 


C = 7ft 


on d£l(t), t > 0 




for the evolution of a particle growing by diffusion or heat flow; a model for a 
capillary drop [ 9 , 10 ] 


o 

II 

> 

1 

<1 


in fl(£), t > 0 




div(i?) = 0 


in fi(t), t > 0 




(Vu + (Vu) T — pi ) n — — yftn 


on dfi(t), t > 0 


(2) 


V n — u • n 


on dQ(t), t > 0; 




models of tumor growth [6, 7 ] of the form 






do 

c— = A a — Xo 
dt 


in fi(t), t > 0 




lb" 

1 

1 

II 

< 


in fi(t), t > 0 




0 = 0 


on dCl(t), t > 0 


( 3 ) 


f*--V 

r\ y n 

on 


on dQ(t), t > 0 




p = 7ft 


on < 9 H(£), t > 0; 




and the classical model of gravity-capillary water waves [ 13 ] 




A <p(x,y,t) = 0 
d y <p(x , — h,t ) = 0 


for x G \ —h < y < r)(x,t) 




1 2 

dt<P +2 l v ^l +OT = 


on y — 


( 4 ) 


- d t r] - V x r] ■ V x ip + dytp = 0 


on y = r)(x, t) 




to describe but a few. In all cases, linearized stability analyses are rather straight- 
forward. For instance, the version of ( 1 ) linearized about the spherical steady state 


fi(i) = 5 d_1 = {lj <= R d : 


M = l}, C(x,t) = 7 


( 5 ) 
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reads 



ACi =0 


in 


|rc| < 1 




dCy dA, 
dr dt 


on 


gd-i 


( 6 ) 


Ci + 7 ^Ai + ^ ^ A^Ai^ — 0 


on 


gd-i 




> 

IT 

II 

e 


on 


S d-i 





where (C i,Ai) denote the first order corrections to the steady state (5). More 
precisely, 



C = 7 + eCi(x, t) and £l(t) 



= G 



M = l + eAi(o;,t), ^ = t —A (7) 



solve ( 1 ) with the initial conditions 




x eR d 



\x\ = 1 + eA(u;), cu = 




(8) 



to first order in e. From ( 6 ), a simple calculation reveals the (linear) stability of 
all perturbations except for those with the least oscillations (Fourier /Spherical 
harmonics of order zero and one). 

As we mentioned in the Introduction, we wish to capitalize on the simplicity 
of the first-order, linear stability analysis through a simple extension to higher 
orders in £, in the hope of attaining a direct approach to non-linear stability. 
More precisely, in keeping with example (1), we seek a solution to ( 1 ), ( 8 ) in the 
form 



C = C(x,t;e) =7 + £ n > 1 C'nOM)e" and (9) 

ft(£) = jze]R d : |z| = 1 + A{uj,t\e) = 1 + £„>i A n (u:,t)e n , u = |f|-| . 

As easily verified, the n-th order correction (C n , A n ) for n > 2 solves a problem of 
the form 



o 

II 

<1 


in 


M < 1 




dC n i dA n ^ 
dr + dt ~° n 


on 


s d ~ 1 


( 10 ) 


C n y ^A n + ^ ^ A n ^ — G n 


on 


s d ~ 1 




A n (w, 0 ) = 0 


on 


s d ~ 1 
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where the functions G\ and G 2 n are given as sums of terms that depend upon 
{[Ck}kZ i Symbolically, equations (10) can be written as 

L (C n , A n ) = 

= f2*n({C k } n k Zl{^k} n k Zl) (11) 

n= 1 

where each \I>™ is simply a monomial in the functions (^{Ck}^Zl , At 

this stage, a natural approach suggests itself whereby the functions (C ni A n ) are 
estimated inductively using (11). The complexity of the right-hand-sides in this 
equation (whose number of terms K n increases with n), however, conspires against 
the use of any bound other than the triangle inequality 

Kn / x 

II (C n , A n ) |U < II ({CfclfclJ , {Afc}^ 1 ) ||„, (12) 

771=1 

where the norms are appropriately defined. As we mentioned, the sum in (11) 
entails significant cancellations among its constitutive terms, which are destroyed 
upon use of (12). As a result, the bound obtained in this manner does not guarantee 
the convergence of the series in (9). 

In fact, this observation is not related to the free-boundary character of the 
problems: an entirely analogous situation arises on an attempt at solving a boun- 
dary value problem via a boundary perturbation approach. To see this, consider 
the simple problem 

A C = 0 in Q £ = [x E M 2 : \x\ = r < 1 + £A(0)} (13) 

c = C(r,6)\ r=1+sm = m for 9 e [0,2 tt] 

and propose a solution C = C(r,Q\e) as in (9) 

C = CoM) + £;C n (r,0)e n (14) 

n> 1 

where now Co(r, 6) is the solution of the unperturbed problem (13) with e — 0. 
More precisely, if 

M 

m= E fp eipe > ( 15 ) 

p= — M 

we have 

M 

C 0 {r, 9) = E frr' p 'e ip9 - 

p= — M 



( 16 ) 
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The analog of (10) is then 

A C n = 0 in fi 0 = {x £ M 2 • \x\ < 1} 

Cn(l,9) = 8 nfl m - J2 Y^dr l Ci(l,e) for 6 e [0,24 (17) 

The cancellations in formula (17) are now rather apparent: for instance, the n-th 
order formula entails increasingly high-order derivatives of the corrections Ci and, 
therefore, high-order derivatives of the perturbation A (6) (upon iteration). On the 
other hand, classical results on analyticity of potentials, such as those that can be 
derived from the work of Calderon [1] and Coifman and Meyer [2] on the theory 
of singular integrals, guarantee that the series (14) converges, for r < 1 and e 
sufficiently small, for perturbations A (6) that are only Lipschitz continuous. This 
apparent contradiction between the high-order of differentiations in (17) and the 
low degree of regularity necessary for the convergence of the series can only be 
reconciled if these high-order derivatives actually cancel out in (17), yielding an 
appropriately regular right-hand side [13, 14]. 

A particularly clear verification of this assertion, which also shows how the 
bound (12) fails to accurately estimate the functions C n , can be derived if we 
consider equation (17) in Fourier space. Indeed, letting 

(n+l)M 

C n (r,0)= J2 d n, P r {p{ e ipe 

p= — (n+l)M 



and 



A (6) k 



k\ ~ ^2 ^ k >p e 

p= — kM 

the recurrence (17) can be written as 

n— 1 

dn,p ~ dn,ofp ^ ^ ^ ^ A n -l,p- 
1=0 q 

In particular, for p = (n + 1)M we have 

n— 1 



M! 



(\q\ — iji — 1))\ 



} di :q . 



Dn — d n Q f m ^ ^ A, 



((/ + 1)M)\ 



ho An ~ l ’ (n ~ l)M {(l + 1 )M - (n - l))\ Dl 



(18) 



(19) 



where we have set 



Dk — dfc,(fc+l)M- 

In Figure 1, we show some numerical results obtained from implementations of 
(19) in the most favorable case of an analytic boundary 



A (6) = 2 cos(0) = e i0 + e~ ie (M = 1). 
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Figure 1(a) shows the effect of cancellations on the accuracy that can be attained: 
at sufficiently high-orders, round-off errors completely overcome the values of in- 
terest. These errors, of course, arise from computing smaller values (i.e. the values 
of D n ) as differences of larger numbers (i.e. the individual terms in the right-hand 
side of (19)). The fact that these large numbers do indeed significantly cancel out is 
further demonstrated in Figure 1(b), which also exemplifies the effect of estimates 
of the form (12). For this, we introduce the majorizing sequence A n , defined by 
the recurrence 

A„ = M/„| + £ |A-..| n -OMl( (r+ ( |± 1 _ 1 “ ) l - 0) 1 A l - (20) 



Clearly, 



\D n \ < A, 



as (20) can be obtained from (19) upon use of the triangle inequality. As we show 
in Figure 1(b), this straightforward estimate, which destroys the cancellations in 
(19), leads to a series with a vanishing radius of convergence. 




Figure 1. (a) Results of the implementation of the recursion 
(19) in exact and finite precision arithmetic, (b) Root test for the 
sequences D n and A n in (19), (20). 



3. Recursive estimation of higher-order corrections 

As we stated in the Introduction, these cancellations can be avoided with a change 
of independent variables that maps the perturbed domains onto the unperturbed 
geometry. For instance, for the geometry of (4) a natural transformation is 



( 21 ) 
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similar transformations can be derived for other arrangements, such as those aris- 
ing in connection with problems (l)-(3). As it can be easily verified, the recurrences 
that arise when proposing solutions in the form of perturbation series for these 
problems in the new variables preserve the structure of that obtained in the origi- 
nal variables. However, substantial differences appear in the right-hand sides, most 
significantly the absence of high-order derivatives of lower-order terms (which, in 
turn, would entail high-order derivatives of the data and the boundary itself). 

For example, let us consider the travelling wave solutions 

<p(x + c t,y), r](x + c t) 

of (4): we are thus led to the system 

A <p(x,y) = 0 
d y (p(x , —h) = 0 

1 2 

[c • V x ] (p + - |Vy?| +gri = 

- [c • V x ] t] - V x ri ■ V x (p + dytp = 0 
which obviously supports the trivial solutions 

<p = 0, Tj = 0, c = c 0 . 

A natural approach to the derivation of nontrivial solutions to (22) is then to seek 
these in the form of perturbation series 

<p(x,y,e) = ^2<Pn(x,y)e n , ri(x-,e) = ’^2 l t] n (x)e n and (23) 

n> 1 n>l 

c(e) = c 0 + ^2 c n s n (24) 

n> 1 

that bifurcate from the trivial solution. In these original variables, the equations 
for the n-th order correction c n _i) take on the form 

A (f n (x,y) = 0 for x £ M d_1 , — h < y < 0 

dy<p n (x,- 1)=0 (25) 

[co * VJ <Pn{x, 0) + [g - -fA x ] T] n (x) + [c n _i • V x ] ipi{x, 0)(1 - <J n> i) = P n (x) 

- [co • Vx\rin(x) + dy(pn(x, 0) - [c n _i • S7 X ] rji(x)(l - 5 Ui i) = Q n {x) 

where, as in (10) and (17), the inhomogeneous terms involve derivatives of lower 
order corrections of increasingly high order. In contrast, if we seek a solution in 
the form 

y'-, e) = Yl y')e n , v(x'; e) = '%2 r )n{x')e n , 

n> 1 n> 1 



for x £ —h < y < r)(x) 

on y — rj(x) (22) 

on y = g(x) 



( 26 ) 
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the analog of (25) reads 

A Mx',y') = F n {x',y') = V X ' • FfaW) + d y 'F 2 n {x' ,y') + F*(x',y') 
d V '<p n (x',- 1) = 0 (27) 

[Co * (Pn (x 5 0 ) 4- Xln{x ) 4“ [c n _ i * Vx'] ^Pl (x , 0) (1 5 n , 1 ) Rn {x ) 

[C 0 ^x'] Xjn (yX ) “l - dy>(p„ {x ,0) [Cn— i * Vx'] T]l {x ) (1 S n> i ) S n {x ) 

where now the right-hand sides do not contain high-order derivatives of {ipk^Zi- 
For instance, 



n — 3 m n — 2 

*V) = EE — 2 ^ ^ ^Z + 1^7cc /( y^n— 1 — Z (1 y ) 

m = 0 1=0 1=0 

n — 3 m n — 2 

-EE (fin — 2 — — l T (f H” D ) ^ ^ dy'^Pn— 1 — l$y' ^n — 1 — + l 

m=0 1=0 1=0 

with similar expressions for F^, F^, F n and S n . At this stage, it can be shown that 
the absence of these high-order derivatives allows for the inductive estimation of 
the terms in the recurrence (27) in appropriate functional spaces. The underlying 
reason is now perhaps rather clear: the main difference between the approaches 
that deliver (25) and (27) resides in the boundary differentiations (with respect to 
e) that must be performed to derive them. Indeed, while for (25) these differenti- 
ations must be performed across the boundary of the domains of definition of the 
field quantities in order to justify the use of the “chain rule”, this justification is 
provided by the change of variables in the case of (27). In fact, the change of vari- 
ables guarantees that differentiations are always performed in the interior of the 
domain of definition of the fields. In this manner, the iterative use of recurrences 
such as that in (27) can lead to general results on non-linear stability of steady 
states for systems like ( 1 ) or ( 2 ), and to interesting new insights on bifurcation 
of solutions (steady states, travelling waves) for (3) and (4); for details, we refer 
to [ 6 , 7, 8 , 9, 10]. 



4. Numerical methods based on boundary variations: consequences 
and improved algorithms 

In this last section, we discuss the consequences of the cancellations described in 
§2 and the estimates discussed in §3 on the stability and convergence properties 
of numerical methods based on boundary perturbations. For the sake of brevity, 
we shall concentrate on the popular “Operator Expansion” method [3, 11, 12, 
16, 17], although entirely analogous analyses hold for most of the other classical 
implementations of these methods (see e.g. [13, 14, 15]). 
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The Operator Expansion (OE) method is an approach which has been shown 
to be well-suited to the theoretical and numerical analysis of rather general bound- 
ary-value and free-boundary problems, and specifically of those where the govern- 
ing equations in the “bulk” are relatively simple (e.g. systems of linear equations 
with constants coefficients). The basic strategy consists in posing the problem en- 
tirely in terms of unknowns defined on the (fixed or free) boundary of the domain 
of definition of the field quantities. In general, of course, these quantities will in- 
clude derivatives of the bulk fields at the boundary which, in turn, depend on 
their behavior inside their domain of definition. On the other hand, if the field 
equations are relatively simple (as is the case for a wide variety of boundary- value 
and free-boundary problems from mathematical physics), these derivatives can be 
obtained from suitably defined (non-local) boundary operators, thus reducing the 
overall approximation problem to a problem of approximation of these maps. To 
exemplify the details behind the procedure, let us consider a classical example of 
application, namely to the equations (4) modeling the evolution of gravity water 
waves (7 = 0). Clearly, letting 

£{x,t) = ip(x, rj(x , t ) , t) and 

v{x,t) = = -VxV(x,t) ■ V x <p(x,T}(x,t),t) + dy<p(x,r}(x,t),t), 

equations ( 4 ) can be entirely rewritten in terms of the quantities 77 ( 2 ;, £), £(x,t) 
and i/(x,t). In fact, introducing the Dirichlet-to-Neumann Operator (DNO) G(r /), 

G0?)[£] = v , (28) 

equations (4) are equivalent to the (Hamiltonian) system [18] 

9tV = 

dt£ = -6 V H, 



where the Hamiltonian H is given by 

= \ f $ G (vM] + 9 V 2 dx. 



Thus, from a numerical standpoint, the full problem is essentially reduced to fin- 
ding an efficient and accurate strategy for the approximation of the DNO in (28). 
In this connection, note that the operator can be explicitly found for a flat surface 
rf — 0. Indeed, for instance, if periodic two-dimensional solutions to (4) are to be 
found 

rj(x + 27 r, t) = r/(x, t), £(x + 27r, t) = £(x, t ) 



on an tank of infinite depth (h = oo), the operator on a flat surface is simply given 

by 

G'(o)E^ p i = Ei^ eip "- 

p p 



( 29 ) 
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In turn, this fact suggests a perturbation method to approximate the DNO on 
general surfaces 77 , wherein the operator is sought in the form of a series 

= (30) 

n> 0 

with G n (rj) = 0(r] n ). Letting D = — iV x , the ansatz in (30) readily leads to the 
recursion [14] 

71 1 yi l 

Gnivm = I Dr'D^rDZ - l^r'TTTTy (3!) 

1=0 ' 

which can be easily implemented in the frequency domain (cf. (29)). 

As in the case of (14), the convergence of the series (30) follows from the 
work in [1, 2] under the assumption that 77 is Lipschitz (and e.g. £ E H 1 ). On the 
other hand, as in (17), the right-hand sides of (31) involve high-order derivatives 
of the boundary 77 (and of the Dirichlet data £) which must, therefore, cancel out. 
As we explained in § 2 , this is a “recipe” for ill-conditioning which, in the present 
context, we further demonstrate in Figure 2. 

These figures display relative mean-squared errors in the Neumann data 
G(t 7 )[£] calculated for some specific solutions in a tank of unit depth (h — 1 ) 
where a formula similar to (31) holds [14]. Indeed, if h — 1, the functions 

Vp( x , V) = cosh(|p|(y + 1)) cos(p • x) (32) 

solve the first two equations in (4), and therefore 

G( 77 )[cosh(|p |(?7 + 1 )) cos (p ■ x)\ = (d y - V x r] ■ V x ) ¥> P \ y=ri (33) 

can be explicitly computed for any 77 and compared with the approximate nu- 
merical solutions. The errors in Figure 2(a)-(c) are displayed as functions of 77 , 
the number of terms retained in the Taylor series, and they correspond to three 
different profiles 77 with decreasing regularity: analytic (Figure 2(a)), four times 
continuously differentiable (Figure 2(b)) and Lipschitz continuous (Figure 2(c)). 
In addition to the OE results, we include the corresponding outcome for two addi- 
tional perturbative approximations, which we refer to as “Field Expansions” (FE) 
and “Transformed Field Expansion” (TFE). In contrast with the OE formulation, 
in the FE approach the DNO is not approximated directly as an operator, but 
rather a posteriori of a perturbation expansion of the fields themselves, as in (23). 
The TFE scheme, on the other hand, is similar to the FE procedure, but now the 
fields are expanded in the transformed variables, as in (26). 

The figure confirms the theoretical predictions of §§2 and 3. Indeed, the ef- 
fects of the ill-conditioning of the OE and FE formulae, arising as a result of 
cancellations, is evidenced here in the form of an explosive divergence of the series 
beyond a few terms. As the theory would predict, the onset of this divergence is 
precipitated by a profile’s roughness, as each term in the right-hand side of the 
recurrences becomes more singular with decreasing boundary regularity. Indeed, 
the figures show that as the profile is varied from smooth to rough to Lipschitz, 
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Figure 2. Relative L 2 error in the Neumann data (33), p = 
3, for various profiles y = r](x). (a) analytic boundary; (b) C 4 
boundary; (c) Lipschitz boundary. Figure (d) corresponds to the 
same profile as in (c) , and it exemplifies the existence of an optimal 
discretization that balances aliasing and ill-conditioning effects. 



the onset of divergence for the OE and FE methods changes from n — 9 to 8 to 
4 and from n — 20 to 13 to 5 respectively. In contrast, and again as expected, 
the results of TFE are consistently stable and only limited by the numerical res- 
olution (N y = number of discretization points in the ^-direction = 64, in these 
experiments). In all cases, a number N x = 256 of discretization points in the 
x-direction were used, which typically provided an optimal compromise between 
aliasing effects — for which N x large is preferred — and ill-conditioning — which 
accentuates with increasing N x . Indeed, a further refinement (to N x = 512), re- 
sults in a loss of accuracy in the OE and FE results for both the smooth and 
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rough profiles, indicating that conditioning errors overcome those that may arise 
from aliasing. For the Lipschitz profile, on the other hand, a discretization with 
N x = 512 actually produces slightly better results (for both OE and FE), but they 
again deteriorate at N x = 1024; see Figure 2(d). For comparison, this latter figure 
also includes the results of TFE with N x = 256 which demonstrate a further and 
unexpected advantage of TFE over the OE implementation, as the latter appears 
to contain more substantial information at high wavenumbers in each term of the 
perturbation series. Indeed, this figure shows that at a resolution of N x = 256 the 
calculation of terms beyond n — 5 with OE is significantly aliased, while that of 
TFE is not (as demonstrated by the higher resolution results of OE). 

The examples above clearly demonstrate the beneficial effect of the newly 
derived recursions underlying the TFE approach. However, these improvements 
come at a cost, namely increased computational effort. Indeed, the appearance of 
“volumetric sources” in the new recurrence (cf. the first equation in (27)) precludes 
the use of the exact solutions (32) as a basis for a spectral method, as it can be 
done within the OE and FE schemes. As it can be readily verified, the use of this 
basis translates into a reduction of dimensionality for the OE and FE procedures, 
as the dependence in the transverse direction y can be analytically derived; in 
contrast, this dependence must be numerically resolved within the TFE framework. 
As we have shown above (see also [14, 15]), however, this increased cost may be 
compensated by a substantial increase in accuracy that can, in fact, allow for 
computations beyond the reach of the OE and FE algorithms. 

Still, if we insist on computational times that are exactly comparable to those 
associated with the OE or FE methods, the theory we have presented also suggests 
another possible improvement. Indeed, as we mentioned (see the last paragraph in 
§3), the instabilities in the OE recursion (31) can be associated with the boundary 
differentiations that must be performed to derive it; the change of variables (21), 
on the other hand, guarantees that differentiations are performed in the interior 
of the domains. In turn, this suggests an alternative implementation of the OE 
method, whereby the DNO G(r]) is computed by analytic continuation through a 
family of operators G(erj) rather than by the more natural G(erj) as in (30). In 
accordance with the foregoing remarks, a good candidate for the family G(erj) is 
one of “Dirichlet-to-Interior Neumann Operators” (DIDO) defined as 

G( £ v)[£,](x) = (dy - eV x r] • V*) ip(x, 6(1 - e) + sr)(a :))) 

where b is to be chosen, and b > 0 so as to ensure interior differentiations. We 
conclude with an example of application of such a scheme. In Figure 3 we present 
a plot of maximum errors in the Neumann data (33) ( p = 5) for the case of a 
sinusoidal profile 

H 

V(x) = y cos(x) (34) 

with H = 0.1 and for several values of the parameter b. The significant gains in sta- 
bility of the DIDO approach over the standard OE method (b = 0) are evident in 
Figure 3(a), which again shows the error as a function of the number n of terms in 
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the Taylor series approximation. Indeed, the figure shows that this enhanced stabil- 
ity allows for the accurate calculation of the Taylor coefficients over a significantly 
larger range for n. And this, in turn, translates into substantial improvements 
in the accuracy of the calculations, particularly when appropriate re-summation 
(analytic continuation) mechanisms are introduced as in Figure 3(b). This last 
figure displays the corresponding errors in the approximations of the Taylor sum 
via (diagonal) Pade functions, showing that DIDO can deliver results that are 
several orders of magnitude better than those of classical OE at a comparable 
computational cost. 





Figure 3. L°° error in the Neumann data (33), p = 5, for the 
sinusoidal profile (34). (a) Taylor series approximation, (b) Ap- 
proximation via [n/2,n/2] Pade approximants. 
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Error Estimates for 
Dissipative Evolution Problems 

Giuseppe Savare 



Abstract. We present a quick overview of the general problem of finding opti- 
mal a priori and a posteriori error estimates for the approximation of dissipa- 
tive evolution equations in Hilbert and metric spaces by means of a variational 
formulation of the implicit Euler scheme. We shall discuss what are the in- 
trinsic metric arguments which are involved in the derivation of the estimates 
and we will present an elementary proof in a simplified finite dimensional case. 
An application to the porous medium equation in the new framework of the 
Wasserstein distance is briefly sketched. 



1. Introduction 

In this paper we are interested in optimal a priori and a posteriori error estimates 
for the approximation of the solution u : [0, T] — > y of the gradient flow equation 

r u '(i) = -v0(«(t)) te [o,ri, m 

\u(0) = u°, U 

by means of the implicit Euler scheme 

U *~ U * 1 = _ V0( ^n )) n = 1) ... >JV; given. (2) 

T n 

Here <j> : 5? — * M is a given functional defined in a suitable ambient space 5? , 
t := {T n }n=i denotes the variable step sizes induced by a partition V T of the time 
interval [0 ,T] into N subintervals := 

V T :={0 = t° r < ... <C =T}, r n = t™ — = |/”|, (3) 

and the equation (2) should be understood as a recursive algorithm which indicates 
how to find U ™ once U™~ 1 is known. 

The discrete solution U T (t := /7^,n = 1, • • • , N, will thus provide an ap- 
proximation of the exact solution u at the nodes of the partition V T . We want 
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to find estimates of the error E T between U T and the exact solution u 

E t := max d(U r (t), u(t)), d being a suitable distance defined on S? , 

addressing the following fundamental issues. 

• The estimates are a posteriori , i.e. they are expressed in terms of com- 
putable quantities which depend solely on the time-steps, the discrete 
solution, and the data. 

• The a posteriori bounds should converge to zero as the maximum of the 
time steps 

|r| := max r„ (4) 

l<n<N 

goes to 0 with an optimal rate w.r.t. the regularity of the solution u. 

• The estimates should depend on the structural properties ( dissipativity ) of 
5? and (j) which ensure the well-posedness of (1): in particular, they should 
not depend on the space dimension of 5? and they are explicitly deter- 
mined without need of solving any auxiliary problem. No extra regularity 
assumptions should be assumed. 

• No a priori constraints should be imposed between consecutive time- 
steps, which could just be tailored to the a posteriori error estimators 
alone. 

Four possible frameworks 

We will focus on frameworks where (1) is, in some sense to be better specified, 
dissipative and its dependence with respect to (w.r.t.) perturbations of the initial 
datum can be explicitly controlled in terms of the distance d. 

a) The simplest case: 5? := R m , d is the usual Euclidean distance, </> is a (qua- 
dratic perturbation of a) C 1 convex function: (1) is a system of ODE’s. 

b ) Infinite dimensional Hilbert spaces 5? := , the distance being induced by the 

scalar product (•,•). The most interesting case in view of applications to nonlinear 
evolutionary PDE’s [18] is provided by lower semicontinuous (l.s.c.) convex func- 
tionals (j) (continuous quadratic perturbations are still allowed), which take their 
values in the extended real line (— oo, +oo]. A crucial feature of the convexity of (j) is 
the variational formulation of both (1) and (2) by means of evolution inequalities. 
In fact, the solution of (1) can also be characterized by means of the system 

(u f (t),u(t) — v) + (f)(u(t )) < (f){v) Vu £ (5) 

which, at the discrete level, corresponds to 

- Ur 1 ), U?-V) + 4>(U?) < <KV) VF G (6) 

they both lead to the nonexpansivity property of the continuous (and discrete) 
trajectories starting from two initial values u°,v° (resp. 

d(u(t),v(t)) <d{u°,v°), d(U?,V?) <d(U°,V?) te [0,T], n = !,••• ,N. (7) 
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In the framework of this well developed theory (see e.g. [4]), the same approxima- 
tion results of case a) can be reproduced. We refer to [14] for a detailed account of 
the various contributions and for a presentation of the estimates and of the related 
applications; here we simply quote the optimal a priori estimates of [3], [15], and 
the linear theory of [8] (the last of an important series of papers on this subject). 

c) <5? is a Riemannian manifold, endowed with its Riemannian distance d. In this 
case equation (1) should be imposed in the tangent space of 5? at the point u(t) 
and we could interpret the notion of convexity as convexity along geodesics. 

Conversely, (2) requires more attention, since now the difference U™ — [/™ _1 
is no more defined. One possible way to circumvent this difficulty is to rewrite (2) 
in a variational form, which is already suggested in the Euclidean case. We simply 
notice that among the solutions of (2) we can select the minima of the functionals 

v ^ K(V) ■■= ^d 2 (V, ur l ) + 4>(V), (8a) 

i.e. UJf, n — 1, • • • , TV, solve recursively the family of minimum problems 

K(U?)=mi n*?(V) n = l,-..,N. (8b) 

(8a, b) still make sense in a Riemannian manifold, and they can be used to approx- 
imate (1). Of course, we expect the error estimates to be affected by the curvature 
of 5?\ nevertheless, it is interesting to note that, at least in the case of a simply 
connected manifold with nonpositive sectional curvature, we can reproduce the 
same result of the Euclidean framework. It is one of the most interesting contri- 
bution of [11], [2] to show that, as in the Euclidean case, these estimates do not 
directly rely on the finite dimension or on the local differentiable structure of the 
manifold, but on simple geometric properties of its Riemannian distance. 

d) (y ) ,d) is a metric space. In this case is no more clear what is the meaning 
of (1), whereas the recursive variational scheme (8a, b) still applies and provides 
a family of (hopefully) approximate solutions to (1). This fact has led several 
authors to use this approximation procedure to define a solution of (1), when 
the differential problem does not exhibit a useful linear structure. In an abstract 
metric framework the contributions of E. De Giorgi and his collaborators [7], 
[6], [1], show how to give a meaning to (1) and provide general conditions for the 
convergence (up to the extraction of a subsequence) of the variational scheme with 
uniform step sizes. 

Before discussing the delicate question of convergence estimates in this ab- 
stract setting, let us briefly recall a particular example, which was also one of the 
main motivation of the present investigation: it arises from the recent papers of 
F. Otto [9, 16] on a new variational interpretation of the Fokker-Planck and the 
porous medium equations as gradient flows w.r.t. suitable functionals in the space 
of probability measures metricized by the so called Wasserstein distance. For the 
sake of simplicity, here we will consider only measures which are absolutely contin- 
uous w.r.t. the Lebesgue measure, and we will identify them with their densities. 
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An example of the metric setting: the porous medium equation and the Wasser- 
stein distance 



Let us introduce the set 5? = ^ 2 (R k ) 



^ 2 (R k ) := iu G L 1 (M fc ) : u > 0, f u(x) dx — 1, f \x\ 2 u(x) dx < +oo j, (9a) 

^ JR k J R k ' 

endowed with the Wasserstein distance (we refer to the recent book [17] for a 
comprehensive and up-to-date overview of this and related topics) 



W 2 (v, w) := min l / \r(x) — x\ 2 v(x) dx : 

1 JR k 

/ C(y)'w(y)dy= / ((r(x))v(x)dx 

JR k J R k 



r : R k —> R k is a Borel map, 

. (9b) 

for all functions ( G C^(R /e )|. 



The map r attaining the minimum in (9b) is usually called the optimal transporta- 
tion between v and w. For m > 1, we also introduce the functional on £? 2 (R k ) 



cp(u) := — [ u rn {x)dx. 
m — 1 J Rk 



(9c) 



In [16] F. Otto showed that the nonlinear ( porous medium) diffusion equation 



d t u — A (u m ) = 0 
u(x , 0) = u°(x) 



in R k x (0, T), 
in R fc , u° G t 



(9d) 



can be considered as the gradient flow of the functional ip in ^^(R fc ) w.r.t. the 
distance W: here, the meaning of “ gradient flow ” has to be understood exactly 
as the limit of the discrete solutions obtained by the recursive minimization 
algorithm (8a, b) with the choice d := W and <fi := ip corresponding to (9a,b,9c). 

Even if &%(R k ) is a convex subset of L 1 (R fe ), segments do not provide the 
shortest path (w.r.t. W) connecting two elements, since they do not have even 
finite length. However, it is worth mentioning that the functional p defined by 
(9c) is still convex along the (constant speed) geodesics of ^(R fc ), as showed by 
[ 12 ]- 



Geodesic convexity is not enough for error estimates in metric spaces 

Let us recall that the well-posedness (and therefore also the problem to find op- 
timal error estimates) for gradient flows of convex functionals in general Banach 
spaces (even finite dimensional) is still completely open. This remark points out, at 
least heuristically, that severe restrictions should be imposed to the general metric 
setting, and that the geometric properties of the distance should play a crucial 
role. 

A first step in this direction has been obtained by U. Mayer [11], who 
considered gradient flows of geodesically convex functionals on nonpositively curved 
metric spaces : these are length spaces where the distance from any given point 
w G 5? satisfies the inequality 

d 2 (v t ,w) < (1 — t)d 2 (vo,w) + td 2 (vi,w) - t( 1 - t)d 2 (vo,v i) t G [0, 1], (10) 
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along any constant speed geodesic curve {^}te[o,i] connecting two arbitrary points 
vo,vi £ y. This property, which was introduced by Aleksandrov on the basis of 
the analogous inequality satisfied in Euclidean spaces (in fact an equality) and in 
Riemannian manifolds of nonpositive sectional curvature [10, §2.3], allows to prove 
(7), and to obtain the convergence of the approximation scheme (and a suboptimal 
a priori error estimate) by following the same ideas of the celebrated Crandall- 
Liggett generation Theorem [5]. It is interesting to note that if d is induced by a 
norm in a vector space and it satisfies (10), then the norm obeys the parallelogram 
rule (simply choose t = 1/2 in (10)) and therefore it is induced by a scalar product. 

These assumptions, however, do not cover the case of the Wasserstein distance 
(9a, b). In fact, it is possible to prove (cf. [2]) that the distance of this space satisfies 
the opposite inequality of (10), thus providing a positively curved space, as formally 
suggested also by [16]. 

A new geometric condition 

The main idea to overcome this difficulty and to derive better error estimates is 
to allow more flexibility in the choice of the connecting curves, which do not need 
to be geodesics but could depend on the distance and on the functional. This 
approach relies on a careful analysis of the techniques of [14] and it is completely 
developed in an intrinsic metric framework in [2] , where a particular attention to 
the case of the Wasserstein distance and its applications is devoted. It is surprising 
that all the estimates are based only on the following geometrical property: 

for every triple w,v o,iq £ y there exists a curve {vt}te[o,i] connecting vq to v\ 

which satisfies 

4>(v t ) < (1 - t)(j)(v 0 ) + t<j>{v i) t £ [0, 1], (11a) 

d 2 (w , Vt) < (1 — t)d 2 (w , vo) 4- td 2 (w , v\) — t( 1 — t)d 2 (v o, iq). (lib) 

In the next section, following the ideas of [2], we will present a new elementary 
derivation of the estimates of [14] in the easiest finite dimensional framework (thus 
avoiding any technical tools related to the theory of maximal monotone operators) , 
trying to use only the metric properties of the Euclidean spaces related to (lla,b). 
We shall obtain estimates independent of the dimension and of the vectorial struc- 
ture, which therefore will be immediately applicable to general Hilbert and metric 
spaces. An application to the porous medium equation will be briefly sketched at 
the end of the paper. 



2. Main result 

In this section we shall deal with the simplest situtation, which is provided by a 
convex C 1 functional cj> defined in a finite dimensional space y := R m , 
endowed with the usual euclidean distance d(u^v) := \u — v\. 
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Since the elementary proof of this result will be based only on the fact that 
for each w,v o, tq G 5? 

the segment v t := (1 — t)v o + tv\, t G [0, 1], satisfies (lla,b), 
we will keep the notation d(u, v) instead of \u — v\ for the distance. 



Theorem 2.1 (Optimal error estimate). Let u be the solution of (1), let U T be the 
discrete solution obtained by solving (2) or, equivalently, (8a, b). Then 

N 

E 2 t = ma xd 2 (U T (t),u(t)) < d 2 (u°,Ul ) + (12) 

t^i'P t 

n= 1 

where is the a posteriori error estimator defined by 

ff . n ;= <t>(Ul~ l ) - 4>m) _ <P(U?~\U?) > 0 _ (13) 

T ' T n rl 

Moreover, if 4> is bounded from below, then 

Y, T l S r ^ l T l - i« f <t>) ■ ( 14 ) 

n= 1 

Finally, the following optimal a priori bound in terms of the gradient of <fi holds 



Y T n^r<M\^( U r)\ 2 - 



We will divide the proof in some steps. 

A “metric variational inequality” for u. Our first remark is that (5) can also be 
written as 



^\d 2 (u(t),v) + (f>(u(t)) < 4>(v) VvGy, £g[0,T]. (16) 

Observe that, being u regular, (16) contains all the information of (1); in particular 
it is easy to derive from (16) the contraction property for continuous trajectories 
(7). In fact, if w is another solution of (16), which we rewrite by introducing 
another variable s 



£td 2 (w(s),v) + (j ) (w(s))<4>(v) Vvey, te [0,T], (17) 

then, choosing v := w(s) in (16), v := u(t) in (17), and summing up the two 
inequalities, we find 

mld 2 (u(t),w(s)) + -§^±d 2 (u(t), w (s)) < 0 s,te[0,T}. (18) 

Evaluating this relation for s = t we obtain 

j^^d 2 {u(t),w{t)) < 0 i.e. d(u(t),w(t)) < d(u°,w°) t e [0,T]. (19) 
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A “metric variational inequality ” for U™. The second step consists in writing an 
analogous variational inequality for the discrete solution: here, we will use (11a) 
and (lib) in a crucial way. In fact, choosing w := t/™ -1 , it is easy to see that they 
yield the following strong convexity property for the functionals $£(•): 



$>t) < (1 - t)*Z(v o) + t^(v i) - 1 - t)d 2 (v 0 ,v 1 ). (20) 

Starting from the minimum property (8b) and applying (20) with vq := fZJ?, v\ := 
V, we get 

$«([/») < < (i _ t)$i(u?) + t$xv) - 2 ^(i - t)d 2 (u?, v) vi^ey. 

The minimum condition says that the right derivative at t = 0 of the right hand 
side is nonnegative; thus, we find 

$”(V) -$?(£/■”) -^d 2 (U?,V)>0 VVey, (21) 

which can also be written as 

±(±#(u?,v) - \d 2 {U?-\V)) + ^(U^ur 1 ) + <t>{U?) < <KV). (22) 

A continuous formulation of (22). We want to write (22) as a true differential 
evolution inequality for the discrete solution U T , in order to compare it with the 
continuous one (16) and to try to reproduce the same comparison argument which 
we used for (18) and (19). A first idea, which was exploited in [14], is to use a linear 
interpolation between the consecutive values of the discrete solution. But an even 
simpler choice is possible (as in [13]): we interpolate the values of the functions 
of UJf instead of interpolating the arguments, since this does not even require an 
underlying linear structure. Therefore, we set 

4> t (s) := “the linear interpolant of </>(£/™ -1 ) and </>([/™)” if s G (£™ _1 ,£™], (23) 



t n — o q _ +n - 1 

Ms) ■■= - — Hur 1 ) + —<t>iK) s e ( t n T -\t n T \ . 

Tn, Tn 



Analogously, for any V G 5? we set 



C-s 



s-t a 



d 2 (s;V) := -z d 2 (U?~\V) + ^d 2 (U?,V) s e (t n T ~ \t n T \. 

Tn Tn 



(24) 



(25) 



Since 



A 

ds L 



-d 2 T (s;V) = J-(d 2 (U?,V) -d 2 (U?-\V)) s G {t n y\t n T i 
(22) becomes 

£K ( fl ; v ) + < 4>{v) + i a T ( s ) v* G (o,r) \ r T , 



( 26 ) 
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where we set, for s G (t? \f£], t? := \{t? + t ? 1 ), and 

^.( S ) : = <j> T {s)-HU?) - 



V 



£f 2 (t/”, C/”- 1 ) 






(27) 



The comparison argument. Taking a convex combination w.r.t. the variable s £ I™ 
of (16) written for v := U™~ x and v := U™, we easily get 

$- t \d 2 T {s,u(t)) + 4>{u{t)) < <p r (s) Me[0,T], (28) 

whereas choosing V := u(t ) in (26) we find 

&±4(s;u(*)) + &.(«) < </>(u{t)) + \@ T (s) t G [0,T], s G [0 ,T]\V r . (29) 

Summing up (28) and (29), we end up with 

^4( S > U W) + §- s d 2 T {s\u(t)) < @ T {s) t G [0,T], s G [0,T] \V T . (30) 
Choosing s = t, we eventually find 



f t d 2 r {t,u(t)) < @ T (t) t £ [0, T] \V T , 
and therefore, being t > d T (t,u(t)) continuous, 

d 2 T (T,u(T)) < 4(0, «°) + f @ T {t)dt. 

Jo 

Since d T (t,V) = d(U T (t),V) if t £ V T , a simple evaluation through (27) of the 
contribution of each subinterval to the integral gives 



(31) 



(32) 



N 



d 2 (U T (T),u(T)) < d 2 (U?u°) + 5>n4\ 



(33) 



Cancellation effects and rate of convergence. Choosing V := U™ 1 in (22), we find 

Hj\u?, ur 1 ) + hk) < hu?- 1 )-, ( 34 ) 

this shows that > 0 and that n </>(U™) is decreasing; we get 

N N 

X>X* < - HU?)) < |r|(^(4) - HU?)) (35) 



71=1 



n — 1 



which proves (14). (15) requires a more refined argument. We start by writing 
(22) at the time step n — 1 with the choice V := U™ 

wr 1 ) - hu?) 

(36) 



< 



2^77 (d 2 (U?- 2 , U?) - d 2 (U?~\U?) - d 2 (U?-\U?- 2 )) . 
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Now, we observe that the triangle and the Young inequalities yield 

d 2 (u?- 2 ,u?) < + 

< (l + ^zi)d 2 (U™- 2 , U "- 1 ) + (1 + iz=±)dP(U?-\u?). 

By substituting this inequality into (36), we obtain, after dividing by r n , 

<t>m ~ L ) - <t>(u?) < d 2 (u?-\u?) d 2 (ur\ur 2 ) 

T n ~ 2 Tl 2 t 2 _j 

If we insert (38) into the expression of we get 

^ d 2 (u?-\ur 2 ) d\ur\u?) 

- 2 r 2 2 r 2 

i ZT n-l ZT n 

and therefore, by (13) and the positivity of we eventually find 



N 



N 



^r n 2 C<M 2 ^C<|T 



n = 1 



n=l 



V n 



2r 2 



(37) 



(38) 



(39) 



)• (40) 



The modulus of the gradient of a convex function. For convex functions the mod- 
ulus of the gradient can also be characterized as 

|V0(w)| = SU P | ■ V G ± w,4>{v) < (£(w)j; (41) 

thus, since < </>(£/£), we get 

mhmii _ ts&iSi < _ emz > < i lw(t oi 2 . 



General metric space and the application to the porous medium equation 

It should be clear from the above proof that 

Corollary 2.2. The statement of Theorem 2.1 holds even if {^,d) is a metric 
space, (f) is a l.s.c. (extended) real functional defined on 5? whose gradient norm 
is defined by (41), u is a solution of the “metric evolution inequality ” (20), U T 
solves the variational algorithm (8a, b), and (lla,b) hold. 

If we want to apply the previous result to the example of (9a,b,c,d), for a 
triple w,vo,vi G R k ), we should exhibit a connecting curve v t , t G [0,1], 
which satisfies (11a, b). Let us denote by ro,ri the optimal transportation maps 
between w and vq, v\ respectively (cf. (9b)); we define the curve v t by duality with 
continuous functions 

f C (y)vt(y) dy '■= [ C((l - t)r 0 (x) + tri(x))w(x) dx VCeC°(M fc ), (42) 

JR k JR k 

and it is possible to prove [2] that (lla,b) are satisfied. Thus we obtain 
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Corollary 2.3. Let u be the solution of the porous medium equation (9d) and let 
U T be the discrete solution obtained by solving the recursive minimization scheme 
(8a, b) with the choices of d,<p corresponding to (9b, c). Then the a posteriori esti- 
mates of Theorem 2.1 hold; in particular, if U® = u° 

sup W 2 (u(t), U T (t)) < \t\ — [ (w°(x)) m dx, (43) 

teV-r m — 1 J R k 



and, ifu m x ! 2 G i? 1 (R fc ), 



sup W 2 (u(t),U T (t)) < 

tev-r 



m 



2 ( ra - 1 / 2) 2 



[ V[(tx 0 (x)) m - 1 / 2 ] 

J R k 



2 

dx. 



(44) 
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A Multi-mesh Finite Element Method 
for 3D Phase Field Simulations 

Alfred Schmidt 



Abstract. In phase field models, different components of the solution (temper- 
ature and phase variable) exhibit a strongly different local behaviour. Espe- 
cially in 3D, the high resolution needed for the phase variable in the transition 
region calls for specially adapted methods for discretization in order to make 
computations with as good a resolution as possible. An optimal discretization 
should use a separate mesh for each component. We present a general frame- 
work for the adaptive solution of coupled systems and its application to phase 
field simulations, making 3D simulations possible even on desktop computers. 



Free boundary problems give rise to coupled systems of partial differential 
equations, which are prominent examples of systems where a careful numerical 
discretization is needed in order to resolve the solution behaviour. Usually, a high 
resolution is essential near the free boundary, while coarser meshes are sufficient 
in the bulk. A reasonable numerical method should use locally adapted meshes to 
fulfil both the needs of accuracy and efficiency. Especially for 3D problems, accu- 
rate simulations may be impossible without appropriate discretization concepts. 
For coupled systems like the phase field system, where the components of the solu- 
tion show strongly different local behaviour, an optimal discretization should use 
specially adapted meshes for each component. 



1. Models for solidification 

An undercooling or oversaturation of a liquid leads to a rapid solidification of 
the material. Models for this behaviour include energy and/or mass transport 
by diffusion and/or convection as well as energy/mass conservation conditions 
across the interface, like the Stefan condition. On a certain (meso-)scale, additional 
surface effects at the phase boundary play an important role, depending on the 
curvature Cr or the velocity Vr of the moving interface T(t), usually described by 
a Gibbs-Thomson relation like 

£ c Cp £yVr H - 0 = 0 on r (t). 

Anisotropic surface effects, where the ‘curvature’ is the variation of an anisotropic 
surface energy which, as well as £ c and e v , depends on the direction of the surface 
normal, lead to dendritic growth. 
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Mathematical models for solidification with surface effects differ mainly in 
the treatment of the free boundary, defining it as a sharp interface, a level set, or 
a diffuse interface (phase field). Numerical methods for solidification simulations 
are developed following the same lines. 

1.1. Sharp interface models 

In a sharp interface model, the phase boundary is a (smooth) hyper-surface of 
the underlying space (a surface in 3D or a curve in 2D). The advantage of the 
treatment of the interface as a smooth surface is its lower dimension and that 
curvature, which appears in the Gibbs-Thomson law, is well defined. Drawbacks 
are that the standard model allows no changes in interface topology, no nucleation. 
The notion of varifolds allows for some possibilities in this direction. 

The free boundary problem leads to a degenerate parabolic equation for the 
interface motion, like an anisotropic mean curvature flow equation, coupled with 
heat or mass diffusion in solid and liquid phases. Additionally, convection can be 
added by coupling to Navier-Stokes’ equations in the time dependent liquid phase. 

Numerical methods for sharp interface simulations. The sharp interface 
model leads to numerical methods with separate discretizations of bulk (3D) and 
interface (2D) (or 2D & ID). 

In the bulk, a fine mesh is essential near the interface to sufficiently resolve the 
behaviour of solutions (temperature, concentration). Adaptively refined meshes in 
finite element methods are crucial to get enough resolution, especially in 3D. A 
numerical sharp interface finite element method with adaptive meshing in 2D and 
3D was presented in [7, 8], with additional convection in 2D in [2]. 

Especially in 3D, the discretization of the moving interface, which substan- 
tially changes shape during the growth process, is a delicate task, which makes 
repeated remeshing or at least mesh modifications necessary. 

1.2. Diffuse interface models 

Phase field models can be formulated either with a double obstacle potential or 
a smooth double well potential. We want to restrict ourself here to an obstacle 
formulation, as introduced by Blowey and Elliott [3]. 

The model describes the evolution of temperature 8 and phase variable x in 
a domain Q and it is a system of two (degenerate) parabolic equations: 

dt{0 + \x)-*M = f 
edtX ~ ediv(a(Vx)) + A(x) - j/?X 9 7# 

in Q x (0, T) plus boundary and initial conditions. The function a may include 
anisotropic solidification parameters, and A is a (set valued) maximal monotone 
graph, the subdifferential of the double obstacle potential, 

(—oo, 0] if s = —1 

0 if s £ (—1, 1) 

[0, Too) if s — Tl 



A(s) = 
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with the effect that values of x are i n the interval [—1, +1]. Further, / is a given 
heat source density, A, ft, and (3 are non-negative, material-dependent coefficients, 
and e > 0 a small parameter. The phase variable x ls equal to -1 (solid) or +1 
(liquid) everywhere but in a narrow transition region of width 0(e). For e — ► 0, the 
solution converges to a solution of the sharp interface model with a corresponding 
Gibbs-Thomson relation. Figure 1 shows the evolution of the phase boundary 
during the solidification of an undercooled liquid from a 2D simulation. 




Figure 1. Solid-liquid interface at different times 

At a first glance, the introduction of a phase variable makes mathematical 
analysis of long-time solutions and numerical methods much easier. The sharp 
moving interface is approximated by the level set of a smooth function, thus even 
allowing geometrical ‘catastrophes’ like topological changes of the interface, nucle- 
ation, etc., without big problems in the mathematical formulation (by introducing 
the notion of viscosity solutions, e.g.). 

Both components of the solution, the temperature and the phase variable, 
show a strongly different local behaviour, see Figure 2. 

• The phase variable is constant outside of a moving narrow strip of width 
O(e), where the phase transition occurs. Here, |V%| = 0(e _1 ). 

• The temperature satisfies the heat equation outside this strip, thus it 
is smooth. Inside the moving strip, the gradient of temperature changes 
rapidly. In the limit £ — > 0, the Stefan condition holds at the sharp inter- 
face, 

(ft^^liquid ^^^solid) ' ^ — AV. 

In order to resolve this behaviour with a finite element approximation, different 
requirements to the discretization hold for both components. 

• The mesh for the discretization of x must have a local mesh width such 
that hs < ce. Outside this strip, where x is constant, the mesh might be 
arbitrarily coarse. In order to be able to track nucleations, the phase field 
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Figure 2. Graphs of phase variable and temperature on 1/8 domain 



system should be solved in the whole domain, not only near the current 
interface. 

• To resolve the temperature behaviour, a much coarser mesh is sufficient 
in the strip than the mesh needed for the phase variable. On the other 
hand, the mesh must have a sufficient fineness also in the remainder of the 
domain. 

A separate discretization for both temperature and phase variable is needed in 
order to meet all these requirements in an efficient numerical method. 



2. Adaptive finite element methods for coupled systems of PDE 

Besides phase transitions, many physical problems lead to coupled systems of 
partial differential equations, too. Especially in case of nonlinear phenomena, the 
components Ui , i — 1 , . . . , n may show a strongly different behaviour (smoothness 
of solutions etc.) in the common underlying domain Q cR d . 

The usual adaptive discretization for systems of PDEs uses the same mesh 
for all components, locally refined on the basis of a posteriori error indicators for 
the sum of error contributions on each mesh element. 

An optimal adaptive discretization should use different locally refined meshes 
for the different components of the solution, controlled by separate error indicators 
for each component. 

In [6] we present a general concept for adaptive finite element methods for 
stationary or time dependent coupled problems. 

The solution components Ui are discretized in finite element spaces X { , which 
are based on different locally refined simplicial grids Si, where all the meshes Si 
are refinements of the same macro triangulation So of SI, see Figure 3 for a simple 
example. 
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Figure 3. Two different refinements of a macro triangulation <Sq 



The advantages of this approach are 

• separately adapted mesh for each component of the solution; 

• altogether fewer degrees of freedom, thus more efficient; 

• by the common macro mesh, a direct local hierarchy of the triangulations 
Si is given. This allows for an exact evaluation of each other component 
Uj on the elements of mesh Si. 

A somewhat similar idea for phase field simulations was proposed by Elliott 
and Gardiner [5] , who used two separate meshes for the temperature and the phase 
variable. A fine mesh with mesh size /i«£ for the phase variable is used only near 
the transition region (selected by a ‘mask’), a coarse mesh with mesh size H = Ah 
is used for the temperature. Use of the ‘mask’ reduces computations for the phase 
variable significantly, but it does not allow to track nucleations, e. g. Both meshes 
are aligned, but no error estimators or local mesh refinements are used. 

2.1. Aspects of implementation 

The adaptive multi-mesh method is implemented in the finite element toolbox 
ALBERT, a joint development with K. G. Siebert [9, 10]. The toolbox uses simpli- 
cial meshes in 2D (triangles) and 3D (tetrahedra) and local refinement by bisection 
of elements, which induces a hierarchical structure of the meshes and the finite 
element spaces. 

The multi-mesh concept introduces the traversal of a common, virtually re- 
fined mesh <Sm, see Figure 4, for the calculation of coupling terms, which involve 
several components or functions from finite element spaces defined on different 
meshes, like Ui$ with Ui G $ E Jj. 




Figure 4. Virtual mesh Sm with locally maximal refinement 
from S \ , S 2 
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2.2. Numerical analysis and adaptive methods 

We recall the standard a posteriori error estimates and adaptive finite element 
methods for a scalar elliptic problem. The error \\u — U\\ between the solution u 
and a discrete approximation U is estimated by a sum of local error indicators, 

e.g. 

\W-Uf < 

ses 

where the indicators rjs must be computable from the discrete solution U and the 
given data of the problem ( a posteriori ) . 

On an quasi-optimal mesh for a given error tolerance tol , the local error 
indicators are equally distributed over all mesh elements, 



Vs 



tol 2 

#5 



for all S G 5, 



compare [1], e.g. A similar approach is used in the case of coupled systems. The 
goal is an estimate of the errors || U{ — Ui\\ by a sum of local error indicators on 
separate meshes, like 



y] IK - Uif < Vl ,S + • • ' + ^ ^n,S- 

i = 1 S(zSi S€.S n 

In order to describe a quasi-optimal set of meshes, one possibility is to split the 
total tolerance tol to the separate meshes 

toll •+-•••+ tol 2 n — tol 2 



and optimize every single mesh. This leads to the decoupled equidistribution con- 
ditions 



vis 



toft 

-—jr for all S G Si, i = 1, . . . , n. 
rr^i 



3. Adaptive method for phase field models 

In a joint paper with Z. Chen and R.H. Nochetto [4], we derive error estimates 
and adaptive methods for the double obstacle phase field system. Denoting by 
u = 6 + Ay the energy density, and again the discrete (finite element) functions by 
uppercase letters, the estimate looks like 



■ “ + V^||X “ ^1Iloo(0,T;L 2 (Q)) 



+ (j l|0 ~ ®lli 2 (c7) + a£ II Vx - VAII^ 



1/2 



dt 



< computable terms, localizable to mesh elements. 




Multi-mesh FEM for 3D Phase Field Simulations 



299 



In [4], we use a common mesh for the discretization of the temperature and the 
phase variable. The local error indicators rjs contain contributions from both com- 
ponents, 

Estimated error < rjo + max r \ | , 

ses m 

where m is the time step index. A quasi-optimal common mesh fulfils the equidis- 
tribution condition for the local indicators 



Vs 



tol 2 
#<S m 



for all S € <S m 



Here, the total mesh element count #<S m is very large because of the fine resolution 
in the strip, thus the local temperature error must be very small also in elements 
far from the interface. 



Adaptive FE method with separate meshes 

A separation of contributions from temperature and phase variable in the error 
indicator is possible: 

Estimated error < rjo + max ( X r ie,s+ X »&s)- 

ses™ ses ™ 

A split of the tolerance to both meshes, as described above, leads to a smaller 
element count #<S™, thus to larger local error tolerances for the temperature error. 
It follows that the temperature mesh contains considerably fewer elements than a 
common mesh. Also the mesh for the phase variable contains less elements, as a 
fine resolution is needed only in the transition region. 

Finally, we show a few numerical results. Figure 6 presents pictures from a 
3D simulation where the phase boundary is a growing sphere. Figure 5 depicts the 
corresponding numbers of mesh elements over time. As expected, the temperature 
mesh has only few elements whose number grows slowly, while the phase variable 
mesh is much finer and grows nearly linearly in time in order to resolve the growing 
free boundary. 

With our method, we expect to be able to efficiently and accurately simulate 
three dimensional free boundary problems. 
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Morse Description and Geometric Encoding of 
Digital Elevation Maps 

Andres Sole, Vicent Caselles, Guillermo Sapiro, and Francisco 
Arandiga 



Abstract. Two complementary geometric structures for the topographic rep- 
resentation of an image are developed in this work. The first one computes a 
description of the Morse structure of the image, while the second one computes 
a simplified version of its drainage structure. The topographic significance of 
the Morse and drainage structures of Digital Elevation Maps (DEM) suggests 
that they can been used as the basis of an efficient encoding scheme. As an 
application we combine this geometric representation with an interpolation 
algorithm and lossless data compression schemes to develop a compression 
scheme for DEM. This algorithm permits to obtain compression results while 
controlling the maximum error in the decoded elevation map, a property that 
is necessary for the majority of applications dealing with DEM. We present 
the underlying theory and compression results for standard DEM data. 



1. Introduction 

The use of a topographic description of images, surfaces or 3 D data has been in- 
troduced and motivated in different areas of research: image processing, computer 
graphics and geographic information systems (GIS) [BPS97, CK98, FS96, KK94, 
SKK91]. In this paper we propose a geometric approach to compress Digital Ele- 
vation Models (DEM). DEM data are a digital representation of a surface terrain 
where each cell corresponds to a point ( x,y,z ) in 3D space, (x, y) being the image 
coordinates and z its height. Obviously, this kind of data requires a large amount 
of bytes to be stored, thus some compression is needed. In the case of DEM data 
it seems reasonable to store only those geometric structures which are of special 
relevance (such as its Morse and drainage structures) and interpolate the rest from 
this non linear subsampling. The object of this paper is to compute these struc- 
tures and to use them to efficiently compress DEM data, while controlling the 
norm of the error. 



Key words and phrases. Image compression, digital elevation maps, Morse structure, drainage 
structure, geometric encoding, geometric sampling. 
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This paper is structured as follows. Section 2 is devoted to the computation 
of the Morse structure of an image. Some basic notions, like the notion of mono- 
tone section of a topographic map, are introduced, and an algorithm to compute 
the Morse structure is proposed. We also briefly comment on the mathematical 
justification of the proposed algorithms. In Section 3 we describe an algorithm 
to compute a simplified drainage structure. Section 4 describes the coding and 
interpolation strategy and we summarize the compression algorithm used in our 
experiments. Section 5 shows comparative results between the proposed algorithm 
and the standards JPEG Lossless (JLS) method and JPEG-2000. Finally, in Sec- 
tion 6 we present the main conclusions of this work. 



2. Morse theory and monotone sections 

The aim of Morse theory is to describe the topological changes of the (iso) level 
sets of a function in terms of its critical points. We will consider here the case 
of 2D real valued data (gray level images, or height data) and we will describe a 
simple and efficient algorithm to compute its Morse structure from its upper and 
lower level sets. 

To make the ideas precise, let us first recall some definitions. Let D be a subset 
of M N . Given a function u : D 1R, we call upper (lower) level set of u any set 
of the form [u > A] := {x E D : u(x) > A} or [u > A] := {x E D : u(x) > A} 
([u < A] := {x E D : u(x) < A} or [u < A] := {x E D : u(x) < A}), where 
A E JR. The (upper) topographic map of a function u is the family of the connected 
components of the level sets of w, [w > A], A G JR, the connected components 
being understood in the relative topology of D. We shall assume that our image 
is a continuous function and each upper or lower level set has a finite number of 
connected components which is true for a discrete image or for a continuous image 
after some basic filtering. In [BCM01] the authors studied the Morse structure of 
the topographic map for continuous functions (a similar study can be done for 
bounded upper semicontinuous functions). They defined a notion of nonsingular 
region of the topographic map trying to express the fact that the level lines of the 
topographic map in a nonsingular region do not change topology, as it happens 
for smooth functions where singularities are understood in the usual way. 

Let u : D — > JR be a function. For each A, /jl E JR, A < /i we define 
U\ = {x e D : A < u{pc) < fi}. 

The connected components of a set X C 1R N will be denoted by CC(X). If x E X, 
the connected component of X containing x will be denoted by cc(X, x) 

Definition 2.1. Let u : D — ► JR be a continuous function. A monotone section of the 
topographic map of u is a set of the form X\^ E CC(U\ ifJL ), for some A, fi E JR with 
A < //, such that for any A', fi' E [A, /i], A' < // the set {x E X\ ^ : A' < u(x) < fi'} 
is a connected component of Uy^. 




Morse Description and Geometric Encoding of Digital Elevation Maps 305 



Due to small oscillations in the image, its Morse structure is too complex, 
i.e., there are many criticalities. To simplify the structure of the topographic map 
while preserving its main features we filter the image with the Vicent-Serra filters. 
If u has been filtered with the extrema filters with an area threshold e and X is a 
connected component of [u > A] or [u < A], then \X\ > e. The resulting image has 
a simplified topographic map structure. Indeed, for any A < /i, there are a finite 
number of connected components of U\^. In particular, there are a finite number 
of connected components of [u > A] and [u < A]. In the sequel, we shall assume 
that 

(H) u has been filtered with the extrema filters. 

Assuming property (H), if X\ ^ is a monotone section, then the family of sets 
In [u — a], a E [A, jj], is a family of nested connected sets [CSS02] . Moreover, the 
union of monotone sections which intersect is a monotone section [CSS02]. This 
allows to define the notion of maximal monotone section containing a given point. 
Let xgD and A = u(x). For each 77 > A, let X\ ^ = cc(U\^,x). We define 

77+ (x, A) = sup{?7 : 77 > A s. t. X\ is a monotone section}. 

Similarly, we define rj- (x, A) = inf {77 : 77 < A s. t. is a monotone section}. 
Note that both numbers are well defined since X\,\ is always a monotone section. 
Note that, by definition, 77. (x, A) < 77+ (x, A). The (open, closed, half open, or 
half closed) interval J(x, A) containing A whose end-points are 77- (x, A), 77+ (x, A) 
determines a monotone section containing x maximal with respect to inclusion, 
which we denote by Xj^ x ^)- 

Maximal monotone sections represent the largest sections of the topographic 
map containing no topological changes and our purpose is to compute them. Intu- 
itively monotone sections are topologically equivalent to truncated cones and the 
maximal monotone ones are the largest truncated cones contained in the graph 
of the image. The interpolation algorithms described in Section 4 are able to re- 
interpolate these truncated cones from the curves bounding them. To compute the 
maximal monotone sections we require some additional definitions. 

Definition 2.2. Let M CO. We say that M is a zonal maximum (resp., minimum) 
of u at height A if M is a connected component of [u = A] and, for all e > 0, the 
set [A — e < u < A] (resp., [A < u < A + e]) is a neighborhood of M. 

Definition 2.3. We say that A E JR is a singular value of the topographic map of 
u if it corresponds to a zonal maximum, minimum, or it corresponds to a level 
where it begins or ends a maximal monotone section, i.e., there is a point x E Q 
such that 77+ (x, A) = A or rj- (x, A) = A. 

Definition 2.4. Let A E JR. Assume that (H) holds. The signature of the level 
set [u > A] (resp. [u < A]) consists of a finite family of points {pi : i = l,...,r} 
(resp. {qj : j = l,...,s}) such that each pi (resp. qj) is a point in a different 
connected component X Xjl (resp. X\j) of [u > A] (resp. [u < A]). The points Pi,qj 
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are selected so that 

u(Pi) = sup i/(x),and u(qj) = inf u(x). 

We denote the signature of [u > A] by sig([u > A]), the signature of [u < A] by 
sig([u < A]). We define sig([u > A], [u < A]) = sig([u > A]) U sig([u < A]). 

To stress the marker point in the notation, we write X x,Pi instead of X x,t . 
As it is proved in [CSS02], under assumption (H), given A £ iR, there is e > 0 
such that sig([u > g], [u < g]) is constant for all g G (A — e, A]. 

Definition 2.5. We say that A E M is a critical value for u if there is a sequence 
| A such that sig([u > g n \, [u < g n ]) ^ sig([u > A], [u < A]) for each n — 

1 , 2 ,.... 

Proposition 2.6. Let A £ M. Assume that (H) holds. Then X is a critical value of 
u if and only if X is a singular value of the topographic map of u. 

The following result is proved in [CSS02] and permits us to compute the 
singular values of the topographic map of u by computing the critical values of 
u. Moreover, at the discrete level, a topological change occurs when going from 
level A to level A — 1 if and only if either i) sig([u > A]) ^ sig([u > A — 1]), or ii) 
sig([u < A]) ^ sig([u < A — 1]). This is equivalent to saying that i) two connected 
components of [u > X] merged at level A — 1, a phenomenon which we call merging 
of upper connected components, or a new connected component of the upper level 
sets was born at level A — 1, or ii) two connected components of [u < X — 1] merged 
at level A, a phenomenon which we call merging of lower connected components, 
or a new connected component of the lower level sets was born at level A. We call 
the first type of criticality of upper type, while the second is called of lower type. 
For a detailed description of the algorithm we refer to the reader to [CSS02]. Many 
different approaches exist to compute the basic Morse structure of images or 3D 
data. We have presented a simple computational approach and have studied its 
mathematical properties in [CSS02] . Let us also note that the singularities of the 
tree of shapes of an image as introduced in [MonOO], [BCM01], coincide with the 
notion of critical value defined here [CSS02]. 



3. Drainage structures 

In the previous section we have developed an algorithm to compute the Morse 
structure of an image. This Morse structure consists on the maxima, minima, and 
the level lines where a topology change occurs (i.e., the boundaries of the maximal 
monotone sections). This can be considered as a reasonable global description of 
a DEM. There are also other structures which are of special interest due to its 
topographic significance in DEM data. These structures correspond mainly to the 
drainage structures (e.g., rivers and ravines). We will present an approach which 
is related to the one in [See89] . Strictly speaking, we do not compute the drainage 
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structures but a version of them which is adapted to our purposes. In a simplistic 
way we can think of the drainage structures as the set of points for which there 
exists at least one direction in which the flow of water is accumulated or repealed. 
We can write down this definition mathematically by considering the set of points 

{x : 3v G iR 2 , e > 0 s. t. u(x) < u(x + tv) Vt G (— £, e), 
or u(x) > u(x + tv) \/t G (—£, e)}. 

Observe that this set of points will contain in particular the maxima and minima 
of u. It has been shown in [CSS02] that this set of points is contained in an at most 
a countable family of curves. Intuitively, these curves contain the drainage struc- 
tures (ridges and valleys), giving also information about boundaries of plateaus for 
example. In the discrete case we only consider 4 different directions (values of v) 
corresponding to 4 different profiles in the image. Concretely, we search maxima 
and minima of the vertical, horizontal and diagonal profiles. In [See89] only two 
directions were used, namely the horizontal and vertical ones. The problem is that 
a large number of extrema can appear due to low oscillations, mainly due to noise. 
In order to solve this problem we have to choose the most significant extrema, in 
our case those which have maximum contrast. Finally, if any computed extremum 
lies in a flat zone (i.e., it is a zonal extremum), we shall replace it by the boundary 
of the flat zone. 

Figure 1 illustrates the whole geometric sampling process. From left to right 
and top to bottom we show the original DEM image, the level lines corresponding 
to its Morse structure, the curves corresponding to the extrema of the profiles, and 
the final sampling made by the union of the two previous structures. A thinning 
step has been performed in order to obtain one pixel width curves. 

4. The interpolation and coding step 

We have described two algorithms that compute important points and curves from 
an image, thereby providing the basic geometric description of DEM data. These 
algorithms can be considered as a non uniform geometric sampling of the image. 
The next step is to interpolate the missing data from our sampling. There exists 
several algorithms to interpolate data from curves and/or points. We have in par- 
ticular tested three of them: the Laplacian model ( min f || Vu\\ 2 dxdy), the AMLE 
model (mmlimp-^oo / || \7u\\ v dxdy), and the thin plate model (f (A (u)) 2 dxdy). We 
remark that, rigorously speaking, only the AMLE model can be used to interpo- 
late values specified on points [CMS98]. In spite of this, we shall also use the 
Laplacian since there are many curves in the data and we may think of points 
as small regions. In order to evaluate these interpolation schemes we have chosen 
as a measure of goodness the entropy of the residual between the original image 
and the interpolated one. This is a natural choice since we want to minimize the 
number of bits used to encode the errors between the interpolated and the original 
images. After several tests we have decided to use the Laplacian interpolation since 
it reported the minimum entropy of the residual. In order to control the maximum 
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Figure 1. From left to right and up to down: the original DEM 
image, computed level lines (white), computed ridge/valley struc- 
ture (white) and the whole image sampling (white). 



(sup) error we simply store/encode the quantized error information (that is why 
the entropy of the residual was a natural measure of goodness for the interpolant). 
See the Appendix for more details on these interpolation processes. 

Now, we need to consider how to encode both the initial curves (geometrically 
sampled data) and the residuals once a sup error e is specified. The geometry of 
the sampled curves and their gray levels are encoded separately. To encode the 
geometry we use a differential chain coding strategy. For the gray levels we use 
an ENO (Essentially Non Oscillatory) based encoding scheme [AD00] which also 
controls the sup error. Finally we compress both the geometry and the gray values 
of the curves using an arithmetic coder. Having these curves and the data on 
them, we can interpolate them by means of the Laplace equation to obtain the first 
estimate of the image. Finally, to control the maximum error, we need to store the 
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Figure 2. Left: Compression ratio of JLS vs the proposed 
method for a set of ten images compressed with e = 5. Right: 
Root mean square error ( RMSE ) for the two methods. 



residuals r. Encoding the residuals can be simply done by quantizing them using 



sign(r ) 



M 



e, and then coding the resulting r q with an arithmetic coder. 



The compression ratios using this approach were already satisfactory. We 
observed that the encoding of the geometry represented the main cost in bits. To 
further improve the encoding of the geometry we have adopted a simple multiscale 
approach. We compute and encode the curves and the residuals in a subsampled 
image and then zoom out the result and recompute new residuals. If required, 
before sampling, we may filter the given image. In order to avoid the well known 
oscillation problem of classical splines when performing the zoom out step we have 
used a shape preserving spline which respects the monotonicity of the original data 
(no new maxima or minima are created). 



5. Compression results 

We report results on a set of 10 DEM images of size 1200 by 1200 pixels with 8 bits 
per pixel. We also report the results of our algorithm when applied to the same set 
of images with a resolution of 16 bits per pixel. In order to compare our results, 
we use the JPEG-LS standard for lossless and controlled lossy image compression, 
being this one the only standard that allows a control on the maximal per pixel 
error [WSS99], and we also use JPEG-2000 [AdaOl] in which, by reintroducing 
the errors, we are able to control the maximum error. In the following JPEG-LS 
is denoted by JL*S, while ours is denoted by ME, standing for morphological en- 
coding. We also report the RMSE as frequently done for lossy image compression 
algorithms. 

Figure 2 shows a more detailed study for the case e = 5. In the left plot we 
can see the compression ratio of both methods for the set of 10 images, the right 
plot corresponds to the RMSE for both methods. The average CR for the case of 
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Figure 3. Up: from left to right a region of the original DEM 
image baker- e and after compressing it with JLS , ME and 
JPEG — 2000 with e = 5. Bottom: level lines of the above images. 



JLS and ME are 18,9872 and 36,7245 respectively. That is, our proposed scheme 
ME reaches almost twice the compression ratio of JLS. In addition, the average 
RMSE (expressed in gray values, not meters) for the cases of JLS and ME is 
2,9038 and 2 , 3782 , respectively. 

We have also tested our algorithm using the full rank of the original images 
(16 bits) obtaining a compression ratio of 12, 6203 and a RMSE of 2.5935 meters 
on average with a maximum error e of 5 meters. 

We have compressed also the set of images at the same size as the one obtained 
with ME (for the 8 bit images only) using JPEG — 2000. Although JPEG — 2000 
does not control the sup error, it is possible to control it afterwards by storing 
some additional values in an auxiliary file. For this set of images this additional 
cost is in average of 584 bytes, which is negligible, since it does not modify the 
compression ratio in practice. In addition, the average RMSE is of 0.9326, which 
is much better than in the case of ME. 

Figure 3 displays the original image and its compressed versions using JLS , 
ME and JPEG — 2000. The figure shows both the gray scale images and its level 
sets. Note that the topographic structures are better preserved in the case of the 
ME and JPEG — 2000 compression, while they are severely distorted in the case 
of JLS compression. 



6. Conclusions 

In this work we have presented some techniques to compute a basic geometric 
representation of images. This representation is given by the Morse and drainage 
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structures of the image. As an application, this geometric image representation 
was used to derive a non uniform sampling strategy which, when combined with 
standard interpolation and coding techniques, provided a novel DEM compression 
algorithm. This algorithm produces compression ratios similar to JPEG — 2000 
and permits to control the maximal error in the decoded image, a property which 
is fundamental for most DEM applications. 



Appendix 

In this Appendix, we give a result which explains why the Laplacian and the AMLE 
interpolations are in some sense shape preserving. For the detailed proof we refer to 
[CSS02] . We shall consider an interpolation operator as a transformation E which 
associates to each open bounded set ft and each function p G C(<9ft) a function 
E(p,Q) G C(ft) such that E(p,ti)\dn — P- We shall say that the interpolation 
operator satisfies the stability principle if 

E(E(p,n)\ d n',n f ) = E{p,Q)\n' 

for any open bounded set £ 1, any p G <9ft, and any open bounded set ft' C ft. 
Suppose that the interpolation operator E satisfies the stability property, we say 
that E satisfies the maximum principle if 

inf p < inf E(p , O') < sup E(p, ft') < sup p (1) 

dW W Q7 dQ' 

for any open bounded set $2, any open bounded set ft' C ft, and any p G C(dSi'). 
If E satisfies the maximum principle and p = a in <9ft, where a e M, then 
E(p , ft) = a in ft, and the same is true in any open bounded set ft' C ft. 

Theorem 6.1. Let ft 2 be an open simply connected set. Let fti be an open set whose 
boundary <9fti is connected and fti C ft 2 . Let ft = ft 2 \fti- Assume that u\qq 1 = A, 
u\dn 2 — M w tth X < p or X > p. Let E be an interpolation operator satisfying the 
stability and the maximum principle. Then E(u\dn,Q.) contains only a monotone 
section in the sense that ft is a monotone section of E(u\qq,Q). In particular, if 
E(u\qq , ft) is the AMLE extension of the boundary data inside ft, then E(u\qq, ft) 
contains only a monotone section. 
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Behavior of a Rigid Body in an Incompressible 
Viscous Fluid Near a Boundary 

Victor N. Starovoitov 



Abstract. The paper deals with the problem of the motion of a rigid body 
in a viscous incompressible fluid. It is investigated under which conditions 
the body gets to the wall with zero velocity or does not touch it at all. An 
example is constructed which shows that the weak solutions of the problem 
admit collisions of the body with the boundary of the flow domain. In this 
case, the velocity field satisfies the energy estimate only. 



1. Introduction 

During the last decade, there have appeared a lot of papers where the problem of 
the motion of rigid bodies in a fluid was studied (see [6]-[3] and references therein). 
Most of them deal with the situation where the bodies do not collide with the 
boundary of the flow and with other bodies. The possibility of collisions is one of 
the main difficulties of the problem. Up to now, it is not entirely understood how 
to state the problem properly, because the description of the collisions requires 
generally some additional relations. However, there are situations that admit a 
global solution of the problem without any collision laws. Namely, in [6, 7, 8] the 
global solvability of the problem is proved under the condition that the boundaries 
of the bodies and of the flow domain are curves of class C 2 in the 2-D case and 
spheres in 3-D case. It is shown there that such a smoothness yields zero relative 
velocity of a body when it touches another one. This fact implies that impulse 
forces do not appear by the collisions. Note that there remain some questions 
related to the extension of the solution to the time after the collision. In particular, 
the solution can be extended in such a way that the body will be immovable after 
it touches the wall (see [3]). This question, as well as questions of solvability of 
the problem, will not be considered in this paper. Our aim here is to study the 
behavior of a rigid body in a fluid when it is touching the wall or it is getting to 
the wall. Of course, all the results can be easily reformulated in the case of the 
interaction of two bodies. 
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Let us recall the statement of the problem of the motion of a rigid body in a 
viscous incompressible fluid ( Problem A) as it was made in [6, 7, 9, 8]. The tensor 
D(u ), which is called deformation rate tensor if u is a velocity vector field, plays 
an essential role in our considerations. This tensor is the symmetric part of the 
gradient of u and its components read as follows 



Dij ( u ) 



1 / dm du i 



l -( 



2 \dxj 



+ 



dx 7 



)■ 



The kernel of this tensor consists of the rigid vector fields. Assume that S is a 
connected domain in R n and u : S —> M 71 is a vector field. Then D(u) = 0 in S if 
and only if there exist a vector a E M 71 and a skew-symmetric tensor Q E M n x M n 
such that 



u{x) = a + Qx for x E S. (1) 

The proof of this fact can be found, for instance, in [10]. Velocity fields of the 
form (1) correspond to rigid motions. In particular, if n = 3, then there exists a 
vector (jj such that Qx = u? x cc, where the sign x stands for the vector product, 
and the representation (1) has the form u(x) = a + x x. Thus, it is possible to 
specify rigid bodies by the condition that the deformation rate tensor vanishes in 
the domains corresponding to the bodies. 

Let ft be a domain in R n ( n = 2,3). We denote by Hp(Q), p E [l,oo], the 
space { u e (W 0 1,p (fl)) n : divu = 0 in ft}, and by iJ p (ft) the completion of i/p (ft) 
in the norm of L p (ft). Given an open subset S of ft, we introduce the following 
space 

K p (S,n) = {ueH^{Q) : D(u)= 0 in<S}. 

We omit the index p in the designation of the spaces if p = 2, i.e., i/fyft) = i/} (ft)? 
i/(ft) = #2(^)7 and K(S,SY) = K 2 (S, ft). The dual space i/ 1 (ft)* with respect 
to the pivot space i/(ft) will be denoted by i/ -1 (ft). The space K(S, ft) was 
introduced and investigated in [6, 7] (see also [9]). 

The motion of a rigid body can be described, for instance, by introducing a 
set-valued function S : [0,T] — > 2 fi , T E (0, 00). We assume that S(t) is an open 
subset of ft for all t E [0,T]. Since the body moves without deformation, there 
exists a family of preserving orientation isometries A4 s ,t : —> R n (s,t E [0,T]) 

such that S(t) = A4 s ^ t S(s). The characteristic function of S will be denoted also 
by S : 



S(Xj t) 



1, xeS(t), 

0, x E ft \ S(t). 



Finally, we denote by L q ( 0, T; K V {S , ft)) the space of functions u E L 9 (0,T; if* (ft)) 
such that E if p (5fy),ft) for almost all t E [0,T]. 



Definition 1.1. We say that a couple of functions 

u E L°°(0,T; i/(ft)) D L 2 (0 7 T; K(S 1 ft)), 
5 E C°’ 1/9 (0,T;L g (ft)), 1 < q < 00 , 
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and a family of preserving orientation isometries M s ,t • ® n (s,t G [0 , T]) 

form a weak solution of Problem A if S is a characteristic function, 



S(t) = M s ,tS(s) for all s,t G [0 , T], 
and the integral equalities 

/ (u • w t + (u <S> u — D(u)) : D(w) + g - w) dxdt = — u Q • wq dx , 

' ' Jn 




t + u • Vr/) dxdt 




dx 



( 2 ) 

( 3 ) 

( 4 ) 



hold for all functions w G W 1 ' 2 (£It) H L°°(0,T; K(S,Q)) and rj G C 1 (11t) such 
that w\ t =T = 0, T]\ t =T = 0. 

Here, = ft x [0,T], g is the external force vector field, /o = f\t=o with / 
standing for u , tr, 5, and rj. 



This definition deals with the case of one rigid body in a fluid. Besides, we 
have supposed, for simplicity, that the densities of the fluid and of the body are 
identically equal to 1. 

Theorem 1.2. Let 0 C E n , So C G H(Q), g G L 2 (0, T; and 

a) Q, and So are connected domains of the class C 2 if n — 2, 

b) Q, and <So are fraZZs if n = 3. 

T/ien for any arbitrary T G (0, oo) i/iere errzste a weak solution to Problem A. 
Moreover, the isometry M s ,t is Lips chitz- continuous with respect to s and t. 

This result was proved in [6] and [7] (see also [9] and [8]). 

In the next sections, we investigate the properties of the solution of Problem 
A in the situation when the body S is in a neighborhood of the boundary of the 
flow domain ft. The most interesting question is related to the contacts of the 
body with the boundary. The motion of the body during and before the contact 
is studied in Sections 2 and 3, respectively. Generally speaking, all the results 
below make sense if the contact is possible. Theorem 3.2 gives sufficient conditions 
which yield the impossibility of the collisions. On the other hand, an example 
constructed in Section 4 shows that velocity fields satisfying the energy estimate 
admit the collisions. 



2. Behavior of the body during the contact 

As it follows from Theorem 1.2, S(t) is a continuous function (for instance, in 
the Hausdorff metric). Suppose that S(t) D 9ft ^ 0 for all t from some time 
interval J C [0,T]. If the velocity vector field u is a function from the space 
L 2 (0, T; K(S, ft)), then u(t) G K(S(t),£l) for almost all t G J. Thus, in order to 
describe the motion of the body in this situation, we have to investigate functions 
from the space K(S, ft) (or K P (S , ft) in a more general case) with S D 9ft ^ 0. 
At first, we consider the case n — 2. 
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Theorem 2.1. Let fl, S C M 2 be connected domains of class C l,0i , a £ (0, 1], such 
that S C Q and dist(<S, dfi) =0. Ifue K P (S , fi) with p > then u(x) = 0 as 
x e S. 

Proof Assume that u is extended by zero to M 2 \ Cl. Let M be a point from the 
set S fl which is non-empty by assumption. 

Let us introduce a new orthogonal coordinate system (^ 1 ,^ 2 ) with the origin 
at M. The axis £1 is tangent to dCl and the axis £2 is directed towards the domain 
S. Since Q and S are domains of class C 1,a , there exist positive constants k and 
Po such that the sets 7r + = {£ £ M 2 : |£i| < po, &|£i| 1+a < £2 < k pj +c *} and 
tt~~ — {£ £ ]R 2 : |^i | < po, -/cpJ +Q! < £2 < —k |£i| 1+Q: } belong to S and R 2 \ Cl, 
respectively (see Fig. 1). 




Figure 1. Touching of the body S and the boundary dCl. The 
domains 7T+, 7 r“, C p and Q p . 



Since D(u) — 0 in S, there exists constants a £ M 2 and uj £ R such that 
u(x) = a + uj (x — xm) ± for x £ S. Here, Xm is the radius- vector of the point 
M and x - 1 is the vector with the components (# 2 , — x\). In the variables £, this 
representation can be written as u(£) = a -h cj ^ ± . 

Let us show that a — 0. Consider the rectangular domain C p = {£ £ M 2 : 
|£i | < p, -kp 1+a < £2 < kp 1+a }, p £ (0, po). We denote by £+ its upper side, 
i.e., £+ = {£ £ C p : £2 = kp 1+a }. For all p £ [1, 00 ], we have 

IMW+) < {2kp 1+a ) ip - 1)/p \\Vu\\ LP(Cp) . 

On the other hand, 



(J^\u(t)\*ds) 1/P = (/^ l a + w 

-Uc t la {PdS 

>\a\(2p)V*- 



^ds) 1/P 

) 1 /P -\co\(J^f d s) 1/P 

\uj\(2p) llp {kp l+a + (p+ 1) _ 



1/p 



P )' 
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Therefore, 

|a| < Cp p \\u\\ w i, P (jr p ) + Cp^ a + Cp, (5) 

where C is a constant. In the assertion of the theorem, we supposed that a E (0, 1] 
and p > As it follows from these conditions, (1 + a) (p — 1) — 1 >0. Since 
u E Hp(Q), the passage to the limit as p — ► 0 in (5) yields a = 0. 

Thus, u(£) = to ^ for £ E S. In order to prove the theorem, we have to 
establish that u = 0. Let us consider the domain Q p = {£ E R 2 : 0 < £i < 
p, < £2 < P £ (0, po). Since divu. = 0, we have 

I u-nds = 0, 

JdQ p 

where n is the outward normal to dQ p . The boundary dQ p of the domain Q p 
consists of three parts: 

do p = r+ u r; u r°, 

where = dQ p Pi 9 / 7r ± and r° = {£ : £1 = p, £2 G [-fcp 1+Q: , kp 1+a ]}. 

We supposed that u = 0 in R 2 \ ft. This condition implies 



u • nds = 0. 



It is not difficult to calculate that 



u-nds = - uo{p 2 + (kp l + a ) 2 ). 



Therefore, 



^ + (*P 1+a ) 2 ) = - / 

J r° 



u • nds = — 



Wl(/0,6)<^2, 



where -ui is the projection of the vector u on the axis £1. By integrating this 
equality with respect to p from 0 to some 7 E [0, po]> we obtain 



’ [ {P 2 + {kp 1+a ) 2 ) dp = — f «i(£) d£. 
Jo Jq ^ 



The integral in the left-hand side of the last equality can be estimated from below 
as follows 

On the other hand, due to the Holder and to the Poincare inequalities, 

f Mt)dt <|g 7 i (p - 1)/ i«ik P( ^)<cia 7 i^- 1 ^7 1+0 

JQ, 



ll^ll W' 1 .p(£ 7 )> 



where C is a constant. Since \Q 7 \ < /c7 2+a , we have 

\Lj\<C^ 2Qp - 2 - a) / P \\u\\ W ,, P(Cy) . 

By passing to the limit in this inequality as 7 — > 0, we find that u = 0. 
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Thus, u = 0 in S and the theorem is proved. □ 

In the three-dimensional case, a body touching the boundary can move. Its 
possible motion is a rotation around the axis which is orthogonal to the boundary 
at the point of touching. 

Theorem 2.2. Let fl,<S c R 3 be connected domains of class C 1,a , a E (0, 1], such 
that S C ft and dist(«S, dSY) = 0. If u £ K p (S,Sl) with p > then u(x) = 
u> x (x — xm) as x E S, where xm is the radius-vector of a point M e dS C l dPt 
and u> is a constant vector in R 3 which is orthogonal to dQ at the point M. In 
particular, if the set dS fl <9fl consists of more than one point, then u(x) = 0 as 
x E S. 



Proof The function u is assumed to be extended by zero to R 3 \ fl. Let M be a 
point from the set dS fl dfl. We introduce a new coordinate system £ = (£i, £2, £3) 
with the origin at the point M. The axes £1 and £2 are tangent to dfi, and the 
axis £3 is orthogonal to them. We shall also use the cylindrical coordinates (r, 6 , z) 
associated with £: £1 = rcosO, £2 = rsin#, £3 = z. 

As fl and S are domains of class C 1,a , there exist positive constants k and 
po such that the sets 7r + = {£ E R 3 : r < p 0 , kr l + a < z < k pj +c *} and 
n~ = {£ E R 3 : r < po, — fcr 1+a > z > —kpl+ a } are contained in S and 
R 3 \ 0, respectively. Since D(u) = 0 in <S, there exist vectors a and such that 
u(x) = a + cv x (x — x M ) for x E S. In the variables £, this representation reads 
as u(£) — a + u> x £. Exactly the same arguments as in the proof of Theorem 2.1 
imply that a — 0. Thus, t/(£) = iv x £. 





Figure 2. The domains 7r + , tt , Q p and Q 



p,cr- 



Let us investigate the vector u>. Consider the domains Q p = {£ E 



r E 



[0,p), 0 E [0, 27r], -A:r 1+a < 2 ; < kr 1+a } and Q p 
(see Fig. 2). Since divu = 0, we have 



{£ E Q p : <7 < 6 < a + 7r} 



/ 



u • n ds = 0, 



( 6 ) 
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where n is the outward normal to dQ p ?C7 . Let us divide dQ Pj(T into five parts: 

dG P ,a = u r- CT u u u e* 

where 

= {« € dQp, a : z = ±kr 1+a }, E° = {£ e : r = p}, 

K,* = it e ^ ® = ^}» = {€ € dG P ,° :0 = (T + 7r}. 



Note that 



and 



where 



I u - nds = 0 , 

Jr p,a 

/ unds = uj - / £ x nds = m(p) u; - r a , 

J r+ J r+ 



m(p) = | p 3 + fc 2 



2a 4- 2 



2 q !+3 



3 2a + 3 r 

and r a = (cos cr, sin cr, 0) in the coordinate system £. 
Thus, (6) yields the estimate 

2 



m(p) \oj ■ r a \ < Y] / \u\ds. 
i = o 



( 7 ) 



It is not difficult to see that 
rS 



[ [ \u\dsdp= f \u\d£< [ |u|d£ < [ \ u \ p d£) 1/P ( 8 ) 

Jo JE0 >o . J<3s, a JQs V JQs ' 

for 5 > 0. We used here the Holder inequality and the estimate \Qs\<C <5 3+c \ 

If Q(r,<r,z), j = 1,2, denotes the value of the function \u(r,6,z)\ as 0 = 
a + 7r (j — 1), then 

S 8 p kr 1+a 8 kr 1+a 



0 yd 



II \u\dsdp = J J J Q(r,a, z) dzdrdp < 6 J J (j{r,cr, z) dzdr 

0 0 —k r 1 + Q: 0 —kr 1 ^ a 

8 kr 1+a p _ l (5 kr 1+Q 

< s(^J J dzdr'j P ^ J J rC^ dzdr'j 

o - 

8 kr 1+a 

( / / <3 dZ<lr ) 



0 —k r 1 + a 



0 —k r 1 + Q 
k 1/P 



8 kr 1+a 

p — i / . <-»\ / 

< CS- 

\ 

0 —k r 1 + a 

By integrating this inequality with respect to a from 0 to 27 r, we deduce the 
estimate 

/»27 t p 8 



pZTT po p 

Jo Jo Jui 



\u\dsdpda < C 5^ {a+?,) 



(/ s jur<«) 



1/p 



for j = 1, 2. (9) 
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Therefore, (7), (8) and (9) imply 



n8 /* 27 t 

/ m{p) dp / | 

Jo Jo 



\u) • r a \dcr < C S 



^(cH-3) 






1/p 



Furthermore, by employing the Poincare inequality and the estimate m(p) > | p 3 , 
we obtain 

2ocp — ac — 3 / f \ ^-/P 

u-T^da <C5 p — ( / \Vu\ p d£) , 

K Jco(g 5 ) J 

where co(Qs) is the smallest cylinder containing the domain Qs (its convex hull). 
Since u G and \co(Qs)\ — > 0 as 5 — > 0, we have 




l 



2n 



\oj • r a \ da = 0. 



That is, (jO'Tfj =0 for all a G [0, 2n\. This fact implies that the vector u; is directed 
along the axis £3. 

The theorem is proved. □ 



As it follows from Theorem 1.2, the weak solution of Problem A satisfies the 
energy estimate. This means that u G K(S\Q) for almost all t G [0, T]. Therefore, 
we have to put p = 2 in Theorems 2.1 and 2.2. In this case, the assertions of these 
theorems hold if a > 2/3 and a = 1 for the 2-D and 3-D problems, respectively. 



3. Behavior of the body before the contact 

In this section, we investigate the motion of the body when it is getting to the 
boundary of the flow domain Q. Suppose that the velocity vector field u is rigid 
(. D(u ) = 0) in S(t) for almost all t and S(t) = A4o,t<$o, where <So is the domain 
occupied by the body at some initial time and M s ,t is the family of isometries 
associated to u, defined in Definition 1.1. Therefore, S(t) C for all t. We denote 
by h(t) the distance between S(t) and dQ. 



Theorem 3.1. Let Q, and So be domains in W 1 (n = 2,3) of class C 1,a ; a G (0, 1], 
and So C Q. If u G L°°(0,T; L 2 (Q)) D L 1 (Q,T; K P (S ,tl)) , p G [l,oo], then h is 
Lips chit z- continuous with respect to t and there exists a constant C such that 



dh(t) 

dt 



< <7/i /3 (£)||w(£)||tf.i(s 2 ) 



for almost all t € [0, T } with (3 = 2 — (l + — j;- 



( 10 ) 



Proof We prove the theorem in the case n — 3. The two-dimensional case can be 
managed in the same way. At first, we show that h is a Lipschitz-continuous func- 
tion. As D(u) = 0 in 5, there exist vector fields a and a ; such that u(x,t) = 
aft) + w{t) x [x — #*(£)) as x G S(t). Here, x*(t) is the mass center of 5, 
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a(t) = dx*(t)/dt and u>(£) are the translational and angular velocity of the body 
S(t), respectively. It is not difficult to calculate that 




Since u G L°°(0, T; L 2 (17)), the functions a and u) are bounded. This fact implies 
that the mapping Moj and, as a consequence, the function h(t) are Lipschitz- 
continuous with respect to t. 

Let us turn to the proof of (10). Let P(t) G dS(t) and Q(t) G <912 be points 
such that h(t) = dist (P(t),Q(t)). It may occur that the points P and Q are not 
uniquely defined. In this case, we take an arbitrary pair. 

As in the proof of Theorem 2.2, we introduce a new orthogonal system of 
coordinates £ = (£i, £ 2 , £ 3 ) with the origin at the point Q(t). The axis £3 is directed 
from Q(t) to P(t) and the axes £1 and £2 are tangential to <912. Denote by £ 
the two-dimensional vector (£ 1 , £ 2 )* Since S(t) and 12 are domains of class C 1,a , 
there exist positive constants k and p 0 such that the sets 7 r+ = {£ G IR 3 : |£| < 
Po, h(t)+k |£| 1+c * < £3 < h(t)+k Po +c *} and 7 r_ = {£ G R 3 : |£| < p 0 , -fc|£| 1+ "> 
£3 > — k belong to S(t) and R 3 \ Q, respectively. Let us consider the domain 

Q P = {£ G R 3 : |£| < p, —k |£| 1+Q < £3 < h(t) + k |£| 1+a }. The condition divix = 0 
implies that 

/ u • n ds = 0, 

JdQp 

where n is the outward normal to dQ p . Notice that dQ p = T+ U T~ U r°, where 

v^ = dg p ndi^. 

We consider the function u to be extended by zero to M 3 \ 17. That is, 




u • nds = 0. 



If N is a point on dS ( t ) , then we denote by and by tin the velocity and 
the outward normal to dS(t ) N. Since u is a rigid velocity vector field, it is not 
difficult to calculate that 



Therefore, 



L 



U - nds = 7rp 2 (up 'Tip). 



np 2 \up ' rip\ < 




ds. 



By integrating this inequality with respect to p from 0 to some a G (0,po)> we 
obtain 
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The integral in the right-hand side of (11) can be estimated as follows 

f M < \Go\ \\u\\ LP(ga) <C(h + 2 ka 1+a ) \g a \ ||V«|| L p (n) , 

Jqo 

where C is a constant. Since \Q a \ < C a 2 (h + 2ka lJtOL ), we get 

| Up • np\ < Ca^-p ( h(t ) + 2kcr 1 + a ) 2 p z{t ), 
where z(t) = ||u(£)||iji (n) . 

Let us take a = ch}^ 1Jr0t \ where c is a constant such that the condition 
<7 G (0,po) holds. It is possible to put c = po diam(fl) _I +^ . Thus, 

| up • np\ < C h,P(t) z(i ), (12) 

where 0 = -^(l + |) + 2 - i. 

Let us define the function rj : [0, T] x dQ, —* M such that y = x + r)(t,x)v x G 
dS(t ), where v x is the internal normal to dQ at x. Since dQ and dS(t) are surfaces 
of class C 1+a , the function ry(^, x) is well defined for x belonging to a neighborhood 
U(t) C dQ of the point Q(t). It is not difficult to see that 



dy(t,x) 

dt 



= (u y • n y )(n y • v x ) 



for almost all t G [0, T] and all x G U (£), where n y is the outward normal to dS{t) 
at the point y = x + rj(t , x) v x . In particular, if x is the radius vector of the point 
Q(t\ then n y • v x — and 



dy(t,x) 

dt 



= — Up • Tip. 



(13) 



Let us show that 



dh(t) 

dt 






for almost all t G [0, T\. Indeed, for all t G [0, T] and r > 0, we have 



(14) 



h(t + r) - h(t) = min r](t + r, x) - min r)(t, x) < r)(t + r, Q(t)) - rf(t, Q(t)), 



h(t) - h(t - r) > T){t, Q(t)) - T](t - T, Q(t)). 



Since the functions h and rj are Lipschitz-continuous with respect to t , the last 
two inequalities imply (14). 

Thus, (10) follows from (12), (13), and (14). The theorem is proved. □ 



The theorem we have just proved enables us to investigate the motion of the 
body near the boundary of the flow domain. 



Theorem 3.2. Let the conditions of Theorem 3.1 be satisfied and, besides, let u G 
L q (0,T;K p (S, Q)) withp,q G [l,oo]. The following assertions hold true 

a) if (3 < 1 and h(t *) = 0 for some t* G [0,T], then lim h(t) \t — £*|~ 7 = 0, where 

ry — £nl 
y q 1-0’ 

b) if (3 > 1 and h(t *) > 0 for some t* G [0,T], then h(t) > 0 for all t G [0,T]. 
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Proof. If s,t G [0,T], by integrating (10) from s to t we obtain 

F 0 {h(s))-C 1 1: z(X) </a| < Fp(h{t)) < F 0 (h{s)) +c\J^ z(X)d\ 
where z(t) = ||«(0llffi(n) and 

Fe{h) - log/., „=1. 



(15) 



Assertion (a) of the theorem follows from (15) by applying the Holder inequality 
and the fact that 2 G L q (0,T). 

Assertion (6) is also a simple consequence of (15). Namely, if (3 = 1, then 
h(t) > h(t *) exp ^ — C | z( A) d\\^j 

for all t G [0 , T], and the required assertion follows. The case (3 > 1 is similar. 

□ 

It is interesting to apply Theorem 3.2 to the weak solutions of Problem A. 
These solutions satisfy the energy estimate only. That is, u G L°°(0, T; L 2 (fl)) fl 
L 2 (0, T; K(S, fl)) and p = q = 2. A body gets to a wall with a zero speed if 7 > 1. 
This condition is satisfied if a > \ in the two-dimensional case (n = 2) and if 
a = 1 in the case n — 3. 

Another interesting question is related to the possibility of collisions. Accord- 
ing to Theorem 3.2(b), collisions are impossible if (3 > 1. For a = 1, this will be 
the case if p > n + 1. Thus, strong solutions of Problem A do not admit collisions. 



4. An example of collisions 

Theorem 3.2(a) describes the motion of the body when it collides with the boun- 
dary of the flow domain. But this theorem does not guarantee that collisions occur. 
In this section, we construct an example which shows that collisions are possible. 

We consider the two-dimensional case (n = 2). Our task is to find a vector 
velocity field u that satisfies the energy estimate and solves Problem A with some 
external force field g. 

Suppose that fl = Br( 0) and S(t) = B r (b(t )), r < R, where B p (b) = {x G 
R 2 : \x — b\ < p}, 0 = (0,0) and b(t) = (b(t), 0). The function b(t) is such that 
S(t) C fl for all t , i.e., \b(t)\ < R — r. Sometimes, it will be more convenient 
to use the function aft) = b(t)(R — r) -1 . Therefore, a G [0,1), and a(t) — » 1 
if dist(<S(£), dfl) — > 0. Let x and £ be two coordinate systems in R 2 . Define a 
mapping F(-,t) : fl — > fl ^ as follows 

*1 = ^1 & ,6,0 = 6+ <*(t) (R - y/&+& ) , 

X 2 — F2(ily(,2,t) = £ 2 - 
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It is not difficult to verify that F(Q,t) = ft and F(B r (0),t) = S(t). Besides, if 
£ £ dft, then F(£,t) = £. 

The inverse mapping G looks as follows 

£1 = Gi(x 1 ,x 2 ,t) = ^ (x 1 - aR + a^J x%{\ - a 2 ) + (crR - x^ 2 ) , 

6 = G 2 (xi,x 2 ,t) = x 2 . 

Thus, G(Q,t) = 0, and G(S(t),t) = B r ( 0). 




Figure 3. The mappings F and G. 

Note that we do not consider the mapping F as a Lagrangian flow. It simply 
maps the domain £1 \ S(t) to a more symmetric one. The symmetric domain cor- 
responds to the value of b being equal to zero. In this case £1 and S are concentric 
disks centered at the origin. 

Let us introduce the polar coordinates (p, 0) associated with £ = (^ 1 ,^ 2 )- 
£1 = pcos#, £2 = psin#. The Jacobians of the mappings F and G have a simple 
form in these coordinates: 

J F (Z,t) = det(— J = 1 - v(t)cosd, 

7 I , (dG\ r —i 1 

g x-F(t,t) e ^\ dx ) x=F($,t) F 1 — a(t) cos 9 

Now we want to define the stream function ^(x,t) of a flow which would 
correspond to the considered situation. Namely, the velocity u of the fluid should 
be such that 

u — 0 on dfi, and D(u) =0 in S. (16) 

Let A : [0, 00 ) — > [0, 1] be a smooth function such that A(p) = 1 for p < r and 
A'(r) = A (R) = A'(jR) = 0. It is possible to take 

A (p) = (R- r)~ 3 (p - R) 2 (2p -3 r + R) 

If, in particular, R = 3 and r = 1, then A (p) = \p(p — 3) 2 . 

Define the stream function as follows 

4>(x,t) = b(t) x 2 p(G(x,t)). 
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where (p(£) = A(y^ 2 + f 2 ) an d & ~ db/dt. The level lines of the function if are 
schematically shown in Fig. 4. 






a = 0 cf — 0.6 (T — 0.9 

Figure 4. The level lines of the stream function if for various 
values of a. 



The corresponding velocity vector field u(x,t) = V^(x,£) has the compo- 
nents 



ui{x,t) = = b(t)<p(G(x,t)) +b(t)x 2 G X2 • V{¥?(Ol€=G(x,t). 

d*ip 

u 2 (x,t) = = -b(t)x 2 G Xl ■ V^(£)| 4=G(Xit ). 



(17) 



Note that u satisfies (16). 

We have to check that u G L°°(0, T; L 2 (0)) fl L 2 (0,T; i/ 1 (fl)). The norm of 
u in L 2 (fl) can be calculated explicitly: 

ll«lli* ( n) = ll u illi 2 (Q) + ll«2||ia(n) = in(R,r)v 1 (a) |d| 2 , 

where pi(R,r) = f^p 3 (A'(p)) 2 dp < oo and 



, N f 2n sin 2 0 

^i(^) = / i 7 d6 = 27r 
J Q 1 — <7 COS 0 



1- VT^ 



27T • 



1 



/Q ±-^COS0 (J 2 1 + y/l - a 2 

It is not difficult to see that ^i(cr) G [7r,27r]. Thus, the following assertions holds 
true. 



Lemma 4.1. The function u belongs to the space L°°(0, T; L 2 (f2)) if and only if 
& G Z/°°(0, T). 

In order to prove that u G L 2 (0,T; IT 1 (fl)), it is enough to estimate in 
L 2 (£It)- After some computations, we obtain 

\\^\\h(Q) < P 2 {R,r)v 2 {<r) |d| 2 , 

where 1 / 2 (^) = (1 — a) -3 / 2 and /i 2 is a bounded positive constant depending on R 
and r. This implies the following lemma. 

Lemma 4.2. The function u belongs to the space L 2 (0, T; H 1 ^)) if and only if 
(1 — <j) _3 / 2 |cr| 2 G L 1 (0,T). 
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Notice that the distance between the body and the boundary reads as h(t ) = 
(R — r)(l — a(t)) and the condition in Lemma 4.2 has the form h~ 3 ^ 4 \h\ G L 2 (0,T). 
This condition is in accordance with (10). Therefore, the estimate in Theorem 2.2 
is precise. 

It is not difficult to find a function a(t) which is equal to zero at some 
£* G [0,T] and satisfies the conditions of Lemmas 4.1 and 4.2. Really, let us take 
T = 1 and a(t) = 1 — (t — t *) 4 . Then & and (1 — <t) -3 / 2 |<t| 2 are bounded. Thus, 
substituting the present function a into (17) gives an example of a vector velocity 
field that satisfies the energy estimate and admits collisions of the body with the 
boundary. 

However, we cannot say yet that u is a weak solution of Problem A. Let us 
consider the time derivative of u. There exists a positive constant /^(i?, r) such 
that 

HV’tlU 2 ^) < V3(R,r)(\a\ + |<r| 2 ). 

This estimate yields the following lemma. 

Lemma 4.3. If a £ L 2 (0,T), then u t G L 2 (0, T; H~ 1 (Q)). 

Notice that a(t) = 1 — (£ — £*) 4 satisfies the condition of this lemma. By 
substituting the function u corresponding to this a into (3), we easily deduce the 
result below. 

Theorem 4.4. If aft) is a function such that or G L 2 (0,T) and (1 — cr) -3 / 2 |<j| 2 G 
L 1 (0,T), then there exists an external force vector field g G L 2 (0, T; if _1 (fl)) such 
that the velocity vector field u given by (17) is a weak solution of Problem A. 
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Crystal Growth, Coarsening and the 
Convective Cahn-Hilliard Equation 

Stephen J. Watson 



Abstract. The coarsening dynamics of a faceted vicinal crystalline surface 
growing into its melt by attachment kinetics is considered. The convective 
Cahn-Hilliard equation ( CCTL ) is derived as a small amplitude expansion of 
such surface evolutions restricted to 1-D morphologies, with the local surface 
slope serving as the order parameter. A summary of the sharp interface theory 
for CCH, that follows from a matched asymptotic analysis, is also presented 
[26]. It takes the form of a nearest neighbor interaction between two non- 
symmetrically related phase boundaries ( kink and anti-kink). The resulting 
coarsening dynamical system CVS for the phase boundaries exhibits novel 
coarsening mechanisms. In particular, binary coalescence of phase boundaries 
is impossible. Also, ternary coalescence occurs only through two kinks meet- 
ing an anti- kink resulting in a kink ( kink-ternary ); the alternative of two anti- 
kinks meeting a kink is impossible. This behavior stands in marked contrast 
to the Cahn-Hilliard equation C7i where binary coalescence of phase bound- 
aries is generic. Numerical simulations of CCH are presented to validate the 
predictions of the sharp interface theory CVS. 

[1] S. J. Watson, F. Otto, S. H. Davis, Coarsening dynamics of the convective 
Cahn-Hilliard equation , Physica D 178 (3-4) (2003) 127-148. 



1. Introduction 

First-order phase transformation under local thermodynamic equilibrium results in 
phases which are identified through minimization of appropriate thermodynamic 
potentials; e.g. the Maxwell rule for a vapor-liquid mixture. However, if equilibrium 
is not present then kinetic effects may present themselves, resulting in new phase 
selection rules; e.g., solute trapping during rapid solidification of a binary alloy. 

The phase ordering of an initially homogeneous unstable state will generally 
proceed in two distinct stages: 

Spinodal decomposition: A primary instability, such as surface tension, will 
drive segregation (nucleation) into a sub-optimal fine-grained structure of 
pure phases separated by diffuse but narrow phase boundaries. 

Coarsening: The morphological (characteristic) length scale Cm{ t) grows in 
time as coalescing phase boundaries annihilate. 
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We use the term coarsening dynamics of a phase ordering process in a broad 
sense which includes, i) the detailed study of the temporal evolution of the phase 
boundaries, ii) the mechanisms by which coarsening occurs, as well as iii) gross 
features like the temporal change in the morphological length scale Cm(^)- The 
coarsening dynamics of a non-equilibrium ( driven ) phase ordering system can be 
profoundly different from its equilibrium counterpart, as we shall see. 

A thermodynamically unstable crystal surface with strongly anisotropic sur- 
face energy in contact with its melt will spinodally decompose into a faceted surface 
[5, 22, 17] ; here the local normal to the surface serves as the phase variable ( order 
parameter). Now, the coarsening dynamics of a faceted crystal surface in thermal 
equilibrium with its melt ( thermally annealed surface) is known experimentally to 
change dramatically if, instead, the crystal is subject to net growth [20, 11, 21]. 
In particular, growth accelerates the rate at which the morphological length scale 
of the surface, Cm , increases. In addition, 2D surface morphologies are radically 
altered [20, 18, 11]. We coin the term annealing-to- growth transition in reference 
to the above effects. 

We consider the annealing-to- growth transition in the setting where attach- 
ment kinetics is the dominant mass transfer mechanism, and restrict our attention 
to one-dimensional surface morphologies (hill/valley structures). We derive the 
convective Cahn-Hilliard equation ( CCH ) via a small amplitude expansion of the 
physical model associated with net growth of the faceted surface. In scaled form 
(CCH) reads, 

qt -eqq x = (W'(q) ~q xx ) xx , (1) 

where the local surface slope q(x,t) serves as the order parameter, subscripts 
denote the partial derivative with respect to time t and space x respectively, and 
' denotes the ^-derivative. The effective free energy W(q) takes the form of a 
symmetric double well with minima at q = ±1, thereby capturing the anisotropy 
of the crystal surface energy. Also, the dimensionless small parameter e multiplying 
the convective term qqx is a measure of the growth strength. 

In the absence of net growth, e = 0, eqn. (1) reduces to the well studied 
Cahn-Hilliard equation (CH) 

qt = (W'(q)-q xx ) xx . ( 2 ) 

This equation originally arose in modelling the spinodal decomposition of binary 
alloys under isothermal conditions [4]; there q is related to the phase fraction. 
Here it serves as a model for the evolution of a thermally annealed 1-D faceted 
(hill/valley) crystal surface [22, 17]. Among the properties of the CH theory are 
the order parameter (phase) selection q — d=l through the bi-tangent construction 
on W (q) [3] ; a thermodynamic equilibrium construction. The coarsening is driven 
by the non-local free energy [15] 

j- ( W(q ) + ^ q x 2 ) dx, 

and exhibits the following features [14, 1, 6, 23]: 
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Figure 1. Schematic representation of the growing faceted sur- 
face; the facet angles have been exaggerated 



• Binary coalescence of phase boundaries is the generic coarsening event. 

• Cm (t) ~ In t 

The coarsening dynamics of CH serves as the archetypal example of a thermody- 
namic equilibrium phase ordering process [2]. 

Remark 1.1. The bi-tangent construction on W(q ) may be viewed as the “small 
slope” version of the Wulff construction on the surface energy 7 (6), which deter- 
mines thermodynamically compatible facet orientations [7] . 

The coarsening dynamics of CCH (0 < e < 1) 1 stands in marked con- 
trast with its equilibrium counterpart CH. This is an expression of the the ex- 
perimentally observed annealing-to- growth transition alluded to earlier. First, the 
bi-tangent construction for phase selection is broken by CCH [16]. This departure 
from thermodynamic equilibrium marks CCH as a driven phase ordering system 
[10]. Two non-symmetrically related phase boundaries consequently emerge, which 
we refer to as kink (hill corner) and anti-kink (valley corner); see figs. 1 and 2. 
In turn, the kinks and anti-kinks interact through a convection-diffusion mecha- 
nism yielding, to leading order, a nearest neighbors Coarsening Dynamical System 
(CVS) representation for the phase ordering process [26]. The CVS theory pre- 
dicts the following surprising and novel features for the coarsening dynamics of 
CCH [26]: 

i. Cm ~ t 1 / 2 provided 1 <C Cm < \ 

ii. Binary coalescence of phase boundaries is impossible. 

iii. Ternary coalescence of phase boundaries may only occur through a kink- 
ternary; two kinks meet an anti-kink resulting in a kink. 

Furthermore, a crossover from power law to logarithmically slow coarsening occurs 
as Cm crosses the (dimensionless) Peclet length scale Cp ~ ^ [26] . All of the above 

x See Remark 1.2 
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Figure 2. Structure of the matched asymptotic composite solu- 
tion of a periodic profile for CCH [26]. Here, • and o denote the 
location of the kink and the anti-kink, respectively. 



are confirmed by direct numerical simulations on CCH, which have been carried 
out in [26, 12]. 



Remark 1.2. When e ~ 1, the morphologies of solutions do not coarsen in time, 
but rather display periodic patterns, while for e 1 solutions become rough [12]. 
This is related to the fact that formally the convective Cahn-Hilliard approaches 
the Kuramoto-Sivashinsky (KS) equation as e — > oo, and solutions of the (KS) 
equation are known to display spatio-temporal chaos . 



The remainder of this paper is structured as follows. First, in Section 2 we 
describe the geometrically exact equation of motion for attachment kinetics con- 
trolled crystal growth. A small amplitude expansion of this equation about a vicinal 
surface is then presented in Sec. 3, from which we derive the convective Cahn- 
Hilliard equation {CCH). In Section 4 we present a synopsis of the Coarsening 
Dynamical System CVS associated with CCH. We conclude in Sec. 5 with direct 
numerical simulations of CCH and CVS. 
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Figure 3. Sign conventions for a liquid/solid interface X 



2. Attachment kinetics controlled crystal growth 

Here we describe a physical model for the growth of a 2D crystal with anisotropic 
surface energy, where attachment kinetics [8] is the dominant mass transport mech- 
anism. The resulting geometric evolution equation for the crystal surface [9, 13] 
takes the form of a ( driven ) anisotropic Willmore flow [27]. 

Let C C R 2 denote the boundary between a crystal and its melt. We orient 
the curve as in fig. 2 and let 6 denote the (oriented) polar angle of the local tangent 
plane, s the arc length, and n = the (signed) curvature. We consider the surface 
energy S of X to take the form [9] 

s = £ J (7(0) + ^a 2 « 2 ) ds ’ ( 3 ) 



where 7 : [0, 2tt] — > R is a given dimensionless function of the local angle 0, while 
£ and a are dimensional constants which set scales of energy per unit length and 
length, respectively. Surface energy anisotropy is captured by the dependence of 7 
on the local surface orientation 6. The curvature term models an edge energy which 
is concentrated in the region where two facets meet, with a setting the length scale 
of the associated rounded edge. It is argued that such edge energies arise from the 
presence of crystalline steps on the surface, and the constant a is related to the 
lattice constant d [24, 19]. 

The surface chemical potential IS defined by the variational derivative of 
<S; namely, 

a 2 ( k s 





( 4 ) 
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where the surface stiffness s(6) := 7 (6) + 7 X, (0). Note that ps, which has the 
dimensions of energy per unit area, expresses solely the local gain or loss of surface 
energy as the interface evolves. 

A crystal surface with orientation 6q is thermodynamically unstable if s(6o) < 
0, and stable if s(9o) > 0. Unstable orientations are missing in the equilibrium 
crystal shape (Wulff shape). Also, a pair of stable facets which meet are globally 
stable if the corresponding facet angles satisfy the Wulff construction [7]. 

We neglect the coupling of the surface evolution to the dissipation of heat 
by imposing constant bulk chemical potentials at the solid and melt sides of the 
interface; this assumption is valid, for example, in the limit of infinite heat con- 
ductivity. Furthermore, we assume that the mass density of the crystal and melt 
coincide, so that there is no induced fluid flow during solidification and melting. 
Now, on a point of terminology, bulk chemical potentials are usually given in units 
of energy per unit mass. Here, because of our uniform density assumption, we may 
conveniently take /i c and p m to denote the energy per unit area of the bulk crystal 
and its melt respectively. 

The interfacial chemical potential fix, which accounts for changes in both 
surface and bulk energies upon variation of the interface X, is given by 

PX ~ f^S U Pc Pm • (5) 

We assume that isotropic attachment kinetics (solidification/melting) is the dom- 
inant mass transfer mechanism, i.e., 

BV n = -p x , (6) 

where V n is the normal velocity of the interface, and B > 0 is the (constant) kinetic 
coefficient with dimensions ([B] = MT~~ l L~ l ). Taking all of the above together, 
we arrive at the following geometric evolution equation for the interface [9, 13]: 

BV n =S [(7(0) + lee{0)) k - a 2 ( k ss + k 3 )] + {m - Us) (7) 

Note that the planar front 0 = 0 moving with normal velocity V n = 
is a solution to (6). Switching to a frame of reference co- moving with this front, 
the evolution equation for the interface takes the form 

BV n =£ [(7(0) +7^(0)) k - a 2 (k ss + /c 3 )] + (w - Ms)(l - cos0) (8) 

3. Long-wave approximation 

We assume that 6 = 0 is a vicinal surface, with a surface stiffness for neighboring 
orientations of the specific form 2 

s<#)=3 (T) <9) 

where 0 < r/ < 1. This guarantees that the planar front 6 = 0 is thermodynami- 
cally unstable, and, furthermore, that there is a globally stable pair of facets with 

2 see remark 3.1 
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6 ~ ±rj. In addition, we assume that the surface energy scale £ appearing in (3) 
scales with rj: 

£ = t]E 0 , (10) 

where Eo > 0. 

Remark 3.1. The term vicinal refers to an unstable surface that is in the vicinity 
of stable orientations. Our specific parabolic choice of surface stiffness is simply 
for convenience, but could be replaced by a general function with similar structure. 

We now proceed to derive the convective Cahn-Hilliard CCTL as a small am- 
plitude expansion of equation (8), subject to the constraints on the surface energy 
given by (9) and (10). In this sense, CCTL serves as a model for the spinodal de- 
composition and subsequent coarsening of a growing faceted surface. 

First, we re-scale (8) with respect to the length scale a , and the time scale 
i.e., s — > ^ and t — > ^t. We arrive at the dimensionless form 

V n = s(0)k - ( K ss + ^K 3 ) +£(1 - COS0), (11) 



where the dimensionless parameter 



^(/^m He) 



is a measure of the growth strength. 

Consider now the evolution of a free surface in the vicinity of the unstable 
planar front 0 = 0. Letting y = h(x , t) be the graph of the free surface, we assume 
0(1) variation in x, 

D~0(1). (13) 

together with the small amplitude expansion 

h = rjhi + r] 2 h 2 H . (14) 

Note that (13) and (14) together imply the small slope condition 

h x = 0(rj). (15) 

It now follows that the normal velocity V n has the expansion 

Vn = v ^t +0 ^' 

Also, arc-length derivatives coincide with x-derivatives to leading order: 

In addition, the angle 6 and the local slope h x coincide to leading order, 

6 = h x + 0(rj 3 ). 

Since k — d6/ds , we also conclude 

K = + 0( r } 2 ), K ss = +0(r) 2 ), K 3 = 0(r] 3 ). 
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Last, we find 



(1 — cos6) = T) 2 



dhi 

dx 



2 + 0(r? 3 ), 



and, upon recalling (9), 



We expand (11) in powers of rj, and then gather the (leading) terms of order 
0(77), which yields 



dh\ 1 / dhi\ 2 „ f dh\ \ d 2 h\ d 4 h\ 

dt £ 2 V dx ) S l dx ) dx 2 dx 4 



(16) 



where 



{l^m He)® 

£ ~ ~Eo ' 

Now, differentiating (16) with respect to x and setting q — h\ x (the re-scaled local 
slope), we arrive at the convective Cahn-Hilliard equation {CCH) 



q t - eqq x = ( W'(q ) - q xx ) xx ■ 

Here, for the particular choice of s given by (9), the effective free energy is 
W{q) ~JJ s(q)dq 2 = p 4 - \q 2 . 



(17) 



Remark 3.2. For a thermally annealed surface we have fi m = ii c , in which case 
5 = 0 and (17) reduces to the Cahn-Hilliard equation (CFL). 



4. The coarsening dynamical system CVS 

The coarsening dynamics of CCH (0 < £ « 1) and CH (s = 0) stand in marked 
contrast with one another. The origin of this distinction rests in the driven (non- 
equilibrium) nature of the phase ordering of CCH. We summarize here the key 
morphological features of the coarsening theory for CCH ; a complete treatment 
appears in [26]. 

First, the presence of convection induces two non-symmetrically related phase 
boundaries ( kink and anti-kink ) [16]. In turn, the kinks and the anti-kinks inter- 
act with their nearest neighbors through a convective-diffusive mechanism which 
arises from their associated non-equilibrium offsets. Fig. 2 shows a uniformly valid 
matched asymptotic expansion of a periodic (though unstable) profile for CCH. It 
clearly exhibits the kink/anti-kink asymmetry, as well as the convective-diffusive 
balance which occurs in the outer region between them. 

Definition 4.1 (Index convention). Given an alternating array of kink/ anti-kink 
locations on the line, we adopt the (arbitrary) ordering convention of letting kinks 
have odd indices, which in turn gives anti-kinks even indices. 
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u 



— $ • e— 

Xi-\ Xi(t) x i + 1 



Figure 4. Here Xi ( t ) denotes the location of the z’th phase 
boundary, Lift) := x*+i — X{ is the distance between the i’th 
and i -f l’th phase boundary, and • and o denote kink and anti- 
kink, respectively; given our indexing convention , in this figure i 
is understood to be odd. 



Let Xi(t) denote the location of the i’th phase boundary at time t , and set 
Li(t) Xi+i —Xi to be the distance between the i’th and i + l’th phase boundary. 
A matched asymptotic analysis [26] shows that the motion of phase boundaries is 
governed, to leading order, by the nearest neighbor dynamical system 



dxi . >. i+i 

~dt } 



J (Li) — J (Li 



where the (effective) flux function J(C) is given by 

£ 1 



J(C) := 



2y/2 exp (C/2) - 1 



(18) 



(19) 



This theory is valid for all morphological (characteristic) length scales Cm 1; 
i.e., whenever the system enters the interfacial regime. 



Remark 4.2. The skew-symmetry between the evolution equations for kink and anti- 
kink reflects the broken mirror symmetry x — ► — x of the convective Cahn-Hilliard 
equation. 

The dynamical system (18) coarsens in time as kinks and anti-kinks meet and 
then annihilate one another. By coarsening, we mean that the average separation 
between phase boundaries grows in time. We refer to the dynamical system (18) 
subject to such coarsening events as CVS ( coarsening dynamical system). 



5. Numerical simulation of CVS and CCH 

A direct numerical simulation of CCH with e — 0.1 is shown in fig. 5; a pseudo- 
spectral explicit in time method on a periodic domain. The initial condition in- 
volved a small random (Gaussian) perturbation of the unstable homogeneous state 
q = 0. One initially observes spinodal decomposition with a wavelength consistent 
with the dominant linear instability of the system. An interfacial regime subse- 
quently emerges, and coarsening then proceeds solely via a unique type of ternary 
event whereby two kinks meet an anti-kink resulting in a kink; the kink-ternary. On 
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Figure 5. Simulation of the CCH evolution equation for e — 0.1. 
The color scheme is gray for q = 1 through white, q = 0, to black 
for q — — 1; courtesy A. A. Golovin. 



x 10 
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Anti-kink 




Figure 6. The initial location of the kinks • and anti-kinks o 
for the comparative study of CCH and CVS\ here denotes the 
location of the kink prior to the small offset 77 from periodicity in 
the most unstable direction [26]. 




Figure 7. Numerical simulation of the CCH and CVS on a perio- 
dic domain with initial data as in fig. 6; courtesy of Scott Norris. 
The space scale (horizontal axis) and time scale (vertical axis) are 
left unspecified since the match displayed is generic. 



the other hand, binary coalescence of phase boundaries is the generic coarsening 
event of the Cahn-Hilliard evolution [14, 6, 23]. 

Now, consider the simple case of a periodic domain of four equi-spaced phase 
boundaries perturbed in the manner exhibited in fig. 6. The direct numerical simu- 
lations of CCH and CVS are presented for comparison in fig 7. One notes excellent 
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agreement up to the point where the phase boundaries meet. Here, the finite width 
of phase boundaries plays a role that is not captured by our sharp interface theory. 
However, the time scale over which they differ markedly is only a small fraction of 
the phase boundary lifetimes. Whence, we expect the coarsening rate of CVS to 
faithfully represent that of CCTL. 

Remark 5.1. A linear stability analysis of an equi- spaced kink/ anti-kink array has 
a dominant instability in which kinks are pairwise pinched toward the intervening 
anti-kinks, in the manner exhibited in Fig. 6. This justifies the choice of initial 
data for the comparative study of CVS and CCTL shown in Fig. 1. 
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